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Erythropoietin Muteins With Enhanced Activity 

This invention was made with Government Support under grant nos. 
R01-HU2949 and R01-GM39900 sponsored by the National Institute of 
Health. The Government has certain rights in the invention. 

s Cross-Reference to Related AppUcadons 

This application is a continuation-in-part of U.S. Patent Application 
Serial No. 08/049,802 filed April 2 1, 1993 which is incorporated herein by 
reference. 

Background 

10 This invention relates to erythropoietin in gei^ral and more particularly 

to modified erythropoietin proteins (erythropoiedn muteins) having improved 
biological activity. 

Erythropoietin (hereafter referred to as Epo) is a glycoprotein hormone 
which, in its mature form in humans, is 166 amino acids long ^tfa a 

IS molecular weight of34 to 38 kD (Jacobs efa/.»iVar^^ 

Epo occurs in a, fi, and asialo forms whidi differ slightly in dieir 
caifoohydrate composition and biological activity (Dordal etal.. Endocrinology 
776:2293-2299(1985)). 

Id terms of biological function, Epo has been leoogmzed as tat 

20 hematopoietic cytoldne that regulates tfiepn)oe8s of red blood c^^^ 

known as erythropoiesis. Eiythropoieds is a controlled physiological process 
whidi normally produces red blood cells in numbers whidi do not impede 
bkxxl flow, but which are sufficient for oxygen transit 

The binding of ^ to its cognate recqnor 0)'Andrej^ A,Dm €r fl/.. 

25 CeU 57:277-285 (1989)) on eiythroid precursor cells in the bone marrow 
results in salvaging these cells from programmed cell death, known as 
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apoptosis (Koury, MJ.. et al.. Science 2^:378-381 (1990)), allowing them 
to proliferate and differentiate into circulating erythrocytes (red blood cells). 
Epo thus facilitates the formation of erythrocytes from erythroid precursor 
cells. 

S The level of Epo in circulation normally controls the rate of red blood 

cell production in the body. Typically, Bpo is present at a low plasma 
concentration sufficient to maintain a steady state concentration of red blood ■ 
cells by stimulating formation of just enough red blood cells to replace those 
lost through normal processes, such as aging. However, when an increase in 

10 the production of red blood cells is needed, the amount of Epo in circulation 
is increased. This red blood cell deficit may be caused, for example, by 
anemia, the general loss of blood through hemorrhage, over-exposure to 
radiation, prolonged unconsciousness, or exposure to high altitudes where 
oxygen intake is reduced. 

IS In mammals, Epo is produced in the fetal liver and adult kidney, 

circulates in the bloodstream, and binds to receptors on committed progenitor 
cells in the bone marrow and other hematopoietic tissues, resulting in 
proliferation and terminal maturation of erythroid cells (Jelknumn, W., 
Physiol. Rev. 72:449 (1992)). The expression of Epo, both mRNA and 

20 protein, is markedly increased by hypoxia, owing to a 3' enhancer and to 
highly conserved elemrats in die promote region (Semenza, G.U et al.^ 
Prvc. Natl. Acad. Sd. 104 SS:S680-1684 (1988); Beck, I. etal., J. Biol, 
aim. 266:15563-15566 (1991); Pugh, C.W. et al., Ptvc. Natl. Acad. Sd. 
USA S8:10553-l(»57: Blanchaid;K.L. et al.. Md. CeUBiol. 72:5373-5385 

25 (1992)). This elegam servomechanism enables ^ to Relate the led oe^^ 
mass of man and oAer aiumals. 

Genes encoding Epo from a number of species have been studied. 
Thus &r, the ^ genes of man (Jacobs, K. er al.^ Nature Ji5:806-10 (1985); 
Un. F.'K. etal., Pwc. Natl Acad. Sd. USA 82:7580-7584 (1985)), a 

30 monkey (Mzcaai/asdctftsm 

a rodent, the mouse (McDonald, J.D. et al., Mol. & Cdl BoL 6:842-848 



wo 94/24160 



PCTAJS94/04361 



(1986); Shoemaker, C.B. et aL, MoL & Cell Biol. 6:849-858 (1986)) have 
been cloned, sequenced, and expressed. There is a high degree of sequence 
homology in the coding region of the mature secreted proteins from these 
species. 

S In particular, the gene encoding human Epo has been cloned and used 

to produce recombinant Epo in cell culture. Recombinantly produced human 
Epo has general utility as a substitute for Epo isolated from natural sources. 
In addition, recombinant Epo has facilitated the production of large amounts 
of Epo free from deleterious substances associated with naturally derived Epo, 

10 which has allowed for greater avmlability and utility of this protein. 

Recombinandy produced Epo has proven eq>ecially useful for the 
treatment of patients suffering from impaired red blood cell production 
(Physicians Desk Reference (PDR), 1993 edition, pp QQl-GOS). Recombinant 
Epo has proven effective in treating anemia associated with chronic renal 

IS failure and HIV-Infiu:ted individuals suffering from lowered endogenous Epo 
levels related to dierapy witii Zidovudine (AZT) (See PDR, 1993 edition, at 
page 602). 

Modifications of die Epo protein which would improve its utility as a 
tool for diagnosis or treatment of blood disorders are certainly desirable. In 

20 particular, modified forms of Epo exhibiting enhanced biological activity 
w(Hilfl be more effective and efficient dian native Epo in die therapy setdng 
when it is necessary to administer Epo to the patient, enabling administration 
less firequendy and/or at a lower dose. Administration of reduced amounts of 
Epo would also presumably reduce die risk of adverse effects associated widi 

25 Epo treatnient, sadk as hypertension, sdzures, headaches, etc. (See PDR, 
1993 edition, at pp. 603-604). 

Unfortunately, available information r^ardii^ the strucoire of the Epo 
enzyme and its function does not permit die accurate prediction of 
modifications whidi would result in such improvements. In fact, die highly 

30 conserved nature of die amino add sequence of Epo indicates diatdiis protein 
is, for die most part, resistant to change and diat most modifications would be 
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expected to have deleterious effects on Epo and its activity (See U.S. Patent 
No. 4,835,260 by Shoemaker). 

Some speculation does, however, exist regarding the possibility of 
creating variants of Epo with altered characteristics that would be preferable 
5 to the wild type protein for particular purposes. A number of modifications 
of Epo have been proposed, including deletions, additions, and substitution of 
existing amino acids. These modifications have been proposed as a means of - 
achieving a variety of effects, including separation of various Epo functions 
on individual protein fragments, improving die efficacy, of recombinant 1^ 

10 production, improving in \ivo stability, etc. Of particular relevance to the 
present invention are those mcklifications proposed to improve the biological 
activity (i.e. ability to stimulate red blood cell production) of Epo. 

U.S. Patent No. 4,703,008 by Lin, F-K. (hereinafter referred to as 
"tiie '008 patent") speculates about a wide variety of modifications of Epo, 

IS including addition, deletion, and substitution analogs of Epo. The '008 patent 
does not indicate that any of die suggested modifications would increase 
biological activity per se, altiiough it is stated that deletion of glycosylation 
sites might increase die activity of Epo produced in yeast (See die *008 patent 
. at column 37, lines 25-28). Also, die *008 patent speculates fliat Epo analogs 

20 which have one or more ^rosine residues replaced widi phenylalaiiine may 
exl^bit an increased or decreased recq>tor binding affiniQf. 

Au^ian Patent Application No. AU*A-S9145/90 by Fibi, M a al. 
(herein referred lo as *Fibi") also discusses a number of modified Epo proteins 
(Epo muteins). Fibi generally q)eculaies about die alteration of amino adds 

25 10-55. 70-85, and 130-166 of Epo. In particular, additions of positively 
charged basic amino acids in die caiboxyl terminal region are purported to 
increase die biological activiQr of 

U.S. P^nt No. 4,835,260 by Shoemaker, C. B. discusses modified 
Epo proteins m± amino add substitutions of die mediioiune at position 54 and 

30 aq>aragine at position 38. Sudi Epo muteins are diought to have inq)ibved 
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stability but are not proposed to exhibit any increase in biological activity 
relative to wild type Epo. 

Beyond the few proposed modifications that have been speculated lo 
enhance the biological activity of Epo, the art provides no further guidance 
5 regarding how Epo can be modified to increase its biological activity. This 
lack of guidance indicates that, based upoii Ae present state of knowledge,, 
other modifications of Epo which may increase biological activity camu)t be * 
predicted and may not even exist. 

The present invention represents the inventors' achievement in 
1 0 overcoming the lack of guidance and unpredictability regarding modifications 
of the Epo protein which increase its biological activity. . 

Summary Of The Invention 

It is tbdrefore an object of the present invention to provide Epo muteins 
with enhanced biological activiQf. 
IS It is a further object of the present invention to provide lecombiiiant 

DNA encoding Bpo mutdns and methods of producing biologically active Epo 
muteins in a host cell culture using such recombinant DNA, 

It is a further object of the present invention to proidde methods of 
tteatii^ blood disorders involviolg abnonnally low led blood cell populadons 
20 usiqg^ muteins at dosages lower than that required for unmodified ^k> to 
achieve Ae same affect. 

It is a further object of the present invention to provide a method of 
d>taint!^ additional Epo mutdns with enhanced biological acti 
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Brief Description Of The Drawings 

Figure L Oligonucleotide Primer Sequences. Oligonucleotide 
primers corresponding to Epo DNA sequences are shown. The localization 
of the primer coincides v^th previously published nucleodde sequences for the 
5 human (Lin. F.-K. et al.. Proc. Natl. Acad, Sci. USA 82:7580-7584 (1985)) 
and murine (McDonald, J.D. et al. , MoL & Cell Biol 6:842-848 (1986)) Epo ' 
genes. EX2R and EX5 are not completely conserved between human and 
mouse (respectively 93% and 95% identity). An equal amount of each 
possible nucleotide was incorporated during the corresponding cycles of those 

10 primer syntheses. EX2R and NCOl are reverse primers and their sequences 
represent the antisense DNA strand. 

Figure 2. PCR Strategy used for the cloning of mammalian 
cDNAs contaming the complete coding sequence of the mature 
Erythropoietin Protein. Genomic amplification and sequendng of exonic 

15 fragments, localized upstream and downstream ftom the nucleotide portion 
coding for the maniie protein, allowed the design of q)ecies ^)ecific primers 
(SP). Utilization of those SP primers and/or of/ sequences that are 100% 
conserved between man and mouse 5' ATG and 3' N(X)1 primers (Figure 1) 
on cDNA templates prepared from kidney of umnduced or hypoxia-induced 

20 aiumals* allows die amplification of a laige variety of mammalian Epo clmes. 
Sense (•^) and antisense (4-) primers are represented by the ano^ Dasbed 
boxes correspond to die coding part (propeptide and mature protdn) of the 
five exons. Gray boxes iq)resent die S' and 3' untranslated regions. 
F^^n J. Cmnparison of IV1/EX2R sequences from various 

25 mammals. The numbering oorre^nds to die published human genomic 
sequence (Un. F.*K. etd., Proc. Natl. Acad. Sd. USA £2:7580-7584 
(1985)). Tte rq)orted human isequenoe was obtained from die amplification 
of purified genomic DNA fiom Hq>3B cells and agreed widi die sequence 
preWously repcmed. The mouse sequence is from McDonald, J.D. et al., 

30 Mol. & CettBioL 6:842-848 (1986). The horse sequence was obtained fiom 
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kidney-extracted genomic DNA. All the other sequences were established 
from PCR amplification of genomic DNAs purified from several mammalian 
renal-derived cell lines. The consensus sequence indicates the positions where 
a unique nucleotide was found in all the reported ^ies/ The boundary 
5 between intron 1 and exon 2 is represented by the ascending arrows. The 
locations of the two PCR primers, I VI and EX2R are shown by the dashed . 
arrows. 

Figure 4. Alignment of the nucleotide sequences of mammalian 
E^o cDNAs. The mouse sequence corresponds to the one previously reported 

10 (McDonald, J.D. et cL, MoL & C€llBioL 6:842-848 (1986)). PCR-produced 
human sequence was obtained from hypoxia-induced Hep3B cell line and is in 
total agreement with the previously published sequence (Jacobs, K. etal.^ 
Nature 31 3:S06-m (1985); Lin. F.-K. Proc. NalL Acad. Sci. USA 
82:7580-7584 (1985)), Monkey (Macaca mulaita), rat, sheep, pig and cat 

15 . were amplified from kidney-purified cDNAs. Amplifications of the human, 
monkey, rat and sheep were realized using the ATG (residues 1 to 20) and 
NCOl (residues 723*742) primers. The line under residues 41 to 56 
rqiresents the specific SPl primer used for the cloning of the pig and cat 
sequences. For these two mammals, 5' sequences for SPl are derived from 

20 the data presented in Figure 3« Arrows indicate the start and the end of the 
codii% sequence (propeptide and mature piotem). 

FtgimS. Fleeted Amino Add Sequdices of Propeptide. 
The amino add sequences are derived from data obtained ftom bodi gemmiic 
IV1/EX2R and cDNA anq>lificadons. The nmnberii^ corresponds to the 

25 human sequence. Ala 1 is the first amino acid of die human mature protdn. 
The asoendiqg arrows show the site of the cleavage by die signal pqiddase, 
as determined for die human and cynomolgus montey protrins. 

Figure 6. AUgmnat of the Primaiy Structmnes of matmne 
marnmalum ^ Pk^tdns. The human, Macaca fiudculaiis and mouse 

30 amino acid sequences were previously rq)orted (lin, jP.-K. a a/. . Pnoc. Natl 
Acad. Sd. USA £2:7580-7584 (1985); lin, F.-K. eial.. Gene 44:201-209 
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(1986); McDonald, J.D. el al, MoL & Cell Biol. 6:842-848 (1986)). The 

residue numbers corre^nd to flie 166 aa of the mature human protein. Plain 

boxes indicate the positions of the predicted four a-helices in the human 

sequence (See Example 11). Dashed boxes show the N- and 0-glycosylauon 

5 sites. Limits between cxons arc indicated by the vertical lines. 

Figure 6A. Schematic representation of the mammal 

erythropoietm. The invariant amino acids among the eight sequences shown • 

in Figure 6 are represented by the black boxes. The localization of each of 

the four a-helices is underlined. The three glycosylation sites are shown as 

10 diamonds. The main disulfide bridge is indicated by the heavy line connecting 

noncontiguous sequences. The small arrows under tiie sequence indicate tlie 

short SH-bridge, missing in tiie rodents. 

F^ures 7-78. Phylogenetic lineages derived from analyses of 
mammalian Epo cDNA sequences encoding 
]5 the full length mature protdns from seven 

spedes. 

Figure 7. Strength of groups bi the maxhnnm par^ony tree 
found on exanuiimg all 945 unrooted trees formed by seven tenidnal taxa. 
This tree of lowest lengdi required 374 base substitutions. Each link between 

20 ancestral nodes has a circled number; tiiis strenigdi of gtoupiqg number is the 
mimmum number of substitutions tiiat must be added to die leqgdi of the 
maximum pardnxmy tree 10 find a tree diat breaks down die barrier (m^^ 
one or more sequences) between die two groups slanted by die interior link. 
Ptgtm 7 A. Hie phylogenetic tree derived from Hie maximimi 

2S paidmony reconstrucdcm. On die b^ of odier molecular eWdenoe 
involving comparative amino add sequence data fiom monotremes, 
marsupials, and many eutiierian q)ecies (Czelusniak, J, et al., Mefft. EnzymoL 
183:601-615 (1990); Czelusniak, h et al. In "Current Mammology\ 
Volume 2. 3rd ed,. H. H, Ganoways, pp. 545-572, Plenum Publ. Corp. 

30 (1990)) tiie root of tius Epo phylogenetic tree is placed on die interior link 
sq)arating the artiodactyl group from die primate, rodent, cat group. The 
numbers on die imemodal links rq>resent die numbers of base pairs by whidi 
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the nodal ancestral and descendant sequences differ. MYA = millions of 
years ago. as inferred fiom palcontological views of cuihcrian phylogcny. 

Figure 7B. Parsimony recwistructlon on that portion of the 
cDNA sequences that are codons for amino adds. The numbers shown as 
5 a fraction on each link are, in the numerator, the number of amino acid 
changing base replacements and. in the denominator, the number of silent base 
replacements. The computer algorithm that carried out this calculation is • 
described in Czelusniak. J. et al.. Nature 295:297-300 (1982) (see the legend 
for Figure 2). 

10 Figures MA. Phylogenetic lineages derived from analyses of 

the myn""**'"" Epo intron 1-exon 2 , 
sequences. 

Figure 8. Strength of groups in the maximum parsimony tree 
found on examining all 135,135 trees formed by nme terminal taxa. As 

15 the two feloid sequences (those of cat and lion) are nearly identical, they were 
treated as a single taxon in this strength of grouping analyas. The maximum 
panimony tree for the segment (about 285 bp) of intron 1 and exon 2 from ten 
spedes required 361 base substinitions. The circled numbers are the strengdi 
of giouping numbers (as defined in the Figure 7 legend). 

20 F^m* 8A. Hie near most pardmonious tree that groops dog 

irith feloid^ Tins tree required 365 base substitutions. The numbers on links 
lepiesent nuxnbeis of base pairs by whidi the nod^ and descftnda m 

seqtaenoes diffor. 

Pigum9-9B. Madd of the Thite-JXmmshnd Stmctun 
25 EiyAropoietin. 

Rgun9. lUbbondiagrwnofEpoUrtiaiystmetur^ Thefouro 
bdtces are labeled A to D; loops between helices aie q>proprialdy named. 
Disulfide bridges are shown and N- and 0-glyco^lation sites are indicaied 
re^)ectively by dark and dotted s^ments. 
30 Figure 9A. Sdiematic representation of ^*s priniary struccuie 

dq>icting piedicted up-up-down-down orientatira of the four antiparallel o 
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heliccs (boxes with arrowhead). This folding paKcrn is strongly suggested by 
the large size of the two interconnecting loq)S AB and CD. The limits of each 
helix were drawn according to Table 11 of Example I. A predicted short 
region of /3-sheet is delineated by the dashed rectangle. The N-glycosylatlon 
5 sites are represented by the dotted diamonds, the 0-glycosylation site by the 
dashed oval. The locations of the two disulfide bridges are shown as solid 
lines. 

Rgure 9B. Cmss-section of the Epo molecule at the level of the 
four a helices. The helical wheel projections are viewed from the NHj-cnd 

10 of each helix. The hydrophobic residues, localized inside the globular 
structure, are indicated by filled circles. The charged and neutral residues 
{aptn and gray circles respectively) are exposed at the surface of the molecule. 

FigunlO. Immunopredpitatlons of wild type Epo and the 
A140-144 mutein. Cos? cells were transfected with pSG5. pSG5-Epo/wt or 

IS pSG5-^A140-144. After fluee days, die were metaboUcally labeled 
vin& pS]-methiomne and p^Js^slwne. Immunppiedpitations of cellular 
extracts and supematants were performed with our polyclonal anu'body. msed 
in rabbit against die native human The immunppredpitates were 
analyzed by SDS-polyaciylamide gel electiophoiea& Lanes 2 to 4 correspond 

20 tocdlularextncts, and lanes S 10 7 ooire^ond 10 culture supernatants, from 
Co^ tiansfonned mdi die Mowiiig: plasmid ii^diott insot (1^ 
wild type Qpo (lanes 3 and and A140-144 (lanes 4 and 7). Lane 1 
i^iesents die protdn motecular wnght slandaid. The two arrows show die 
normal secretion of die wild type Epo (35-37 kD) and die cytoplasmic 

25 retention of die mntein A140b 144 (-28 kD). 
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Figuns IJ-IIB. Interconnecting Ixiop AB. 
Ftgun 11. Schematic representatimi of the loop AB showuig the 
localization of mutebis with various deletions and ambio add 
replacements. The dashed arrows point to the positions of the serine 

5 substituuons (in A48-52). The two N-glycosylation sites are represeiited by 
the gray diamonds. The small Cys29=Cys33 disulfide bridge is indicated. 

Figure llA. Amount and biological activities of secreted mutdns. 
The upper bar gr^hs show the relative secretion of wild type and loop AB 
muteins as determined by radioimmunoassay. The lowest bar graphs display 

10 the calculated specific activity (ratio bioassay/RlA) for each mutein, in 
comparison with the value obtained for the wild type ^o (ratio = 100%). 

Figure IIB. HCD57 cell proliferation as a function of uicreasing 
concentration of wild type and serine-substituted Epo muteins. HCD57 
cells (10* per ml) were cultured for tiiiee days in a % well microtiter plate 

IS with media contuning increaslog oonoentntions of secreted proteins 
(mU -milli^nits of a relative mass measurement). The line graphs show 
the cdlular giowft as measured by 'H-thynudinc tq>take for cells cultured with 
wild type ^ (O), ^ mutein F48S (O), Epo mutein Y49S (X). Epo mutein 
A50S (♦), Epo muteiiiW51S(D), and Epo mutein K52S (a). The number 

20 of viabteodlswofi also ineasured with the Mitiraloiimetric assay ai^ 
similar cun«s. PioiiiiBiatitm experiments usii« the 
(Komatsa. r\.,etaL, Cancer Res. 5i:341-348 (1991)) and the Kiysial assay 
(K^. G.. Exp. Henwtd. J/:649-«) (1983)) produced idendcd results. 
Figans 12-12A. Interconnecting Loop CD. 

25 Ftgun 12. Sdiematki^presentatkmortlieloopCDsliowliigtiie 

locationorthreedeietionmnteins: A105-109,Alll-119,A12M26,andtlie 
insertionorsevenresiduesaflerL]isll6(mycepttope). TheO-glycbsyiation 
site is indicated by the dashed oval. 

Hgure 12A. Secr«tion and biological adhitics of the mutdns 

30 located in loop CD. The two bar graphs were created as described in 
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Rgiire 1 1 A. The two mutants Al 1 1-1 19 and 1 16/myc were normally secreted 
and had full biological activities. 

Figures IS ISA. In Vitro Translation of the Epo WHd 1^. 

Figure 13. Analysis of the ^'S-laI)clcd translation products by 
5 SDS-PAGE. One-step transcription/translation reactions were performed in 
the SP6-TnT rabbit reticulocyte lysaie system. 1/30 of each reaction was 
resolved on a 15% polyacrylamide gel. i^ne Mow Mr standard from 
Amersham; lane l-in vitro reaction without added plasmid; lanes 3 and 4 are 
translation products obtained after incubation of 1 /tg of circular p64T-Epo, 
10 in the presence or absence of canine pancreatic microsomal membranes, 
req)ecdvely. 

Figure 13A. Binding of the in vitro translated Epo wild type onto 
Epo Receptor-GTS-agarosc beads. 6x10* cpm (counts per minute) of 
purified ^*S-labeled erythropoietin products, processed with microsomes (+) 

IS or not (-) were incubated in the presence of the extra-cytoplasmic domain of 
the Bpo receptor (EREJ, foUomqg the protocol described by Harris, K.W., 
etal.. J. mol. Chm. 267:15205-15209 (1992). Identical binding 
demonstrated that the conservation of the propeptide did not impair the 
hormone/receptor interaction. 

20 Figures 14^14Jl aXIH-end of E^. 

» FigunU. Sdieinatie lepresentation of the analyied 
oo^re^onduig to tiie deletion of dte four last anuno acids iil63-166 and tiie 
iq)lacenients of the residues 162 to 166 by a KDEL or poly (His) sequences. 
FtgmUA. Relative secretion of diese niotdiis. Thebioactivtties 

25 in die siq>ematant$ (Krystal, C, Exp. HematoL 77:649-60 (1983)) and the cell 
extracts (Komatsu, N.,et al.. Cancer Res. 57:341-348 (1991)) of transformed 
Q>s7 cells were measured by in vitro prolifcFation assay usiqg HCD57. More 
KDEL mutant remained in tiie cytosol of die C^os?, when conq>afed widi tiie 
wild type Epo and A163-166 or poly (His) muteins. However, all tiie 

30 analyzed muteins had tiie same q>ecific activity as that of tiie wild type. 
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Figure 15. Bacterial Expression of Wild Type Epo. 

Figure 15 Panel A. Diagram of the fusion protein. An NHj- 
terminal 22 amino acid long peptide, containing a 10 histidine stretch, was 
fused to the mature erythropoietin sequence. Factor Xa cleavage allowed the 
5 recovery of the mature Epo with only two extra residues at its amino terminus. 

Figure 15 Panel B. IFTG induction of the fusion protein. 
Transformed E. coli BL21 (DE3) cultures (ODaoo==0.6) were grown in* 
presence of 1 mM IPTG. Aliquots were collected at 0 (lane 2), 1 (lane 3), 2 
(lane 4) and 3 hours (lane 5) and analyzed on a 15 % SDS-polyacrylamidc gel, 
10 stained with (i)omassie Brilliant Blue. High level production of the fusion 
protein was rapidly obtained. Lane 6 corresponds to an aliquot from 
transformed bacteria giown for 3 hours in a medium without IPTG. Lane 1 
is a low molecular weight standard. 

Figure 15 Ptod C. PuriTication of the fusion protein. After 3 
15 houra of IPTG induction, the produced (His)io-^ was solubilized in 6 M 
guanidine-HCi and purified on a niclcel aflini^ rean by increasing imidazole 
concentrations following die pET-His system protocol (Novagen). Samples of 
die column eiuants were analyzed by SDS-PAGE and stained with Coomasde 
Brilliant Blue. Lane 1-elution by 20 mM imidazole; lane 2-elution by 100 
20 mM imidazole, releasing die fusion protein; lane 3-dielation of the nidcel by 
a 100 mM EDTA wash; lane 4-molecular wdg^ standaid. 

Figure 15 PandD« Detection of the ELcoUreccmibinantE^ ma 
Western blot. Solubilized proteins were 8q>arated on a 15% SDS- 
polyaciylanude gel, transferred to a nitrocellulose membrane and probed widi 
25 a 1/2000 dilution of our native wild type polyclonal antibody, as described 
under Materials and Methods. Lane l*analysis after oxidative reduction; 
lane 2-after dialysis against die factor Xa buffer, and lane 3-after factor Xa 
cleavage* 

Figure 16. Relationship Between Production of Muteins and 
30 Proposed Secondaiy Structare. This bar grq>h shows die amount of 
secreted proteins in die supematants of transiemly expressed Epo mutants, as 
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detected by radioimmunoassay. The mutcins were aligned over a schematic 
representation of the native Epo molecule. Each deletion is shown as a 
stippled bar. the width of which is proportional to tiie number of residues 
deleted. The four a helices are represented by the black rectangles. The two 
disulfide bridges are indicated. These mutagenesis results are in good 
agreement with our proposed four a-hclical model of Epo. 

Figure 17. Biological Activity of Helix A Muteins. The top panel 
shows a helical wheel projection of Epo helix A with arrows indicating the 
positions where modifications were made to create muteins. The lower panel 
is a dose response curve represeming die biological activity of wild type Epo 
(0) and helix A muteins B13A (♦), R14A (0), E18A (O). K20A {■*■), and 
E21 A (o) over a range of dosages (X-axis; mU=miIli-units of Epo, a relative 
mass measurement) as determined by the relative incorporation of pH)- 
thymidine measured in counts per minute (cpm) (Y-axis) into Epo dependent 

15 HCD57CCU8. 

F^n 18. Biological Activity of Helix D MuUbis. ,The top panel 
shows a helical wheel projection of Epo hdix D with arrows indicating the 
positions where modifications were made to create muteins, TTie lower panel 
. is a dose response curve representing Ae biologlcai activity over a range of 

20 dosages (X-axis) of wild type Epo (Q) and helix D muieins D136A (♦). 
R139A (TO, K140A (O) and 143A (x) as determined »>y Ae relative 
incorpotaiion of pHphyMdiiie (Y43ds) into Epo depemto HCD57 cells. 
Figm 19, Biological Activity oTMoteins A4iacent to HeDx D. 

The top panel shows the region adjacent to the C-tcnninal boundary of Epo 
25 helix D widi arrows indicating die positions where modifications were made 
to create muteins. The lower panel is a dose response curve representing the 
biological activity over a range of dosages (X-axis) of wild type Epo (13) and 
muteimi adjacent to helix D K152A (♦). K154A (B). and Y156A (O) as 
determined by die relative incorporation of pH]-tiiymidine (Y-axis) into Epo 
30 dependent HCD57 cells. 
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Figure 20. Comparison of Muteins With Increased Biological 
Activity. A dose response curve representing the biological activity of wild 
type Epo (-*•), and muteins R143A (□) and K154A (♦) with increased 
biological activity relative to wild type Epo is shown over a range of dosages 
5 (X-axis) as determined by the relative incorporation of pH)-thymidine (Y -axis) 
into Epo dependent HCDS7 cells, h. 

Figure 21. Comparison of Muteins With Increased Biological* 
Activity in Non-malignant Cells. A dose re^nse curve representing the 
biological activity of wild type Epo (-*-), and muteins R143A (□) and K154A 

10 ( ♦ ) with increased biological activity relative to wild type Epo, is shown over 
a range of dosages (X-axis) as determined by the relative incorporation of 
pH]-thymidine (Y-axis) into murine spleen cells. 

Figure 22. Comparison of Muteins With Increased Biological 
Activity in a Human Cell Une. A dose response curve representing the 

IS biological activity of wild type Epo (x), and mutems R143A (□), and K1S4A 
(♦) with increased biological activity (relative to wild type Epo) is shown 
over a range of dosages (X-axis) as determii^ by the relative incorporation 
of pH]-thymidine (Y-axis) into human Epo-dq)cndent UT-T/Epo cell line 
. (Komatsu, N. et d.. Cancer Res, 51: 341-348 (1991). 

20 Figure 23. ComparistHi of Mutebis with Increased Biological 

Acti^ in a Human CeD Une. A dose response curve rq>tesentii% die 
biological activity of wild-type f^o (Q) and mutein N147A ( ♦ ) widi increased 
biological activity (idati ve to wild-^ EjX)) is shown wex a laiige oif dosages 
(mU) (X-axis) as detemuned by the lelative inoorporadon of pH]-thymidine 

25 (Y-axis) into human ^x>^q)endentirr-7/E^ cell line. 

Figure 24. Comparison of Mutdns with Increased Biolciglcal 
Activity in a Human CeD Line« A dose response curve representiiig the 
biological acdvity of wild-type Epo (Q) and mutein N147A ( ♦ ) vath iiK^reased 
biological acdvity (relative to wild-type ^) is shown over a range of dosages 
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(mlJ) (X-axis) as determined by the relative incoiporation of pHl-thymidine 
(Y-axis) into human Ep(Hlependent UT-7/Epo cell line. 

Figure 25. Comparison of Muteins wHh Increased Biological 
Activity in a Human CeH Line. A dose response curve representing the 

5 biological activity of wild-type Epo (□) and mutein N147A (♦ .0) with 
increased biological activity (relative to wild-type Epo) is shown over a range 
of dosages (mU) (X^ucis) as determined by the relative incoiporation of [»H]- • 
thymidine (Y-axis) into human Epo^cpcndent UT-7/Epo cell line. 
Biological Activities of Helix A Muteins. 

10 Figun 26. Comparison of Miitems with Increased Biological 

Activity in a Human CeU line. A dose response curve representing the 
biological activity of wild-type Epo (b) and mutein K20A witfi increased 
biological activity relative to the wild type Epo, is shown over a range of 
dosages (X-axis) as determined by tiie relative incorporation of pHJ-Uiymidine 

15 (Y^s) into UT-7 1^ cells. 

Ftgun 27. Comparison of Muteins with Increased Biological 
Activity fa a Human CeH line. A dose response curve representing the 
biological activity of wild^ype Epo and mutein Y49S with increased 
biological activliy relative to the wiM type Epo. is shown over a range of 

20 dosages(X-axis)asdcternuiiedbytheietotiveinooiporati6nofpH>thyim 

(Y-axis) into UT-7 Epo oelis. 

Biological Activities of Hdbc B Mutefais. 

FIgun 28. Cominrisoa of Mutefas with Increased Biological 

Activity fa a Human CeD Lhie. A dose lesponse curve lepteseniing the 
25 biological activity of wild-type Epo (0) and mutem A73G widi increased 

biological activity relative to the wild type Epo, b shown over a range of 

dosages (X-axis) as determined by the relative incorporation of pHhdiymidine 

(Y-axis) into UT-7 Epo cells. 

Biological Activities of HeUx D Mutefas. 
30 Figure 29. Comparison of Mutefas with Increased Biological 

Activity fa a Human CeD Une. A dose response curve representing the 
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biological activity of wild-type Epo (0) and mutein Ki40A with increased 
biological activity relative to the wild typt Epo, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pH]-thynudiije 
(Y-axis) into UT-7 ^ cells. 
5 Figure 30a. Comparison of Mutelns with Increased Biological 

Activity in a Human CeU line. A dose re^nse curve rq)resenung the 
biological activity of wild-type Epo (Q) and mutein R143A widi increased • 
biological activity relative to the wild type 1^, is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pHJ-thymidine 

10 (Y-axis) into UT-7 Epo cells. 

F^n 30b, Comparison of Muteins with Increased Biological 

Activity in a Mouse Cell Une. A dose response curve representing the 

biological activity of wild-type Epo (Q) and mutein R143A with increased 

biological activity relative to the wild type Epo, is shown over a range of 
15 dosages Ql-aas) as determined by tte relative incotpoiation of pH>diyaudine 

(Y-axis) into HCD 57 cells. 

Figun 31, Comparison of Mutdns with Increased Biolo^cal 

Activity in a Human Cdl Une. A dose le^nse dirye rqiresenting the 

biological activi^ of wild-type Epo (□) and mutein S14dA with increased 
20 biological activity relative to the mid ^ is shown over a lange of 

dosages (X-«ils) as determined by the relative inooipoiation of pH]-thynudine 

(Y-axis) into UT-7 ^ cdls. 

Pigiut 32. Comparison of Mutdns wifli Increased Bidiogjkal 

Adtvity in a Human Cdl Lfae. A dose re^nse carve rq>resenting d>e 
25 biological activity of wild-type Epo (B) and motein N147A with increased 

biological activity relative to die wild ^pe Epo, is shown over a taitge of 

dosages (X-axis) as determined by d>e rdative incorporation of pIQ-diynUdine 

(Y-axis) into UT-7 ^ cdls. 

figure 33a. Comparison of Mutdns with Increased Bidlo^cal 
30 Activity in a Human CeD Une. A dose re^nse curve rq)resenting the 

bido^cal activity of vwld-type Bpo (b) and mutein K154A witit increased 
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biological activity relative lo the wild type is shown over a range of 
dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 
(Y-axis) into UT-7 Epo cells. 

Figure 33b. Comparison of Muteins with Increased Biological 

5 AcliWty in a Mouse Cell Line. A dose response curve representing the 
biological activity of wild-type Epo (□) and mutcin K154A mih increased 
biological activity relative to die wild type Epo, is shown over a range of • 
dosages (X-axis) as determined by the relative incorporation of pH]-thymidine 
(Y-axis) into HCD 57 cells. 

10 Figure 33c. Comparison of Muteins with Increased Biological 

Activity in Non-Malignant Mouse Cells. A dose response curve representing 
the biological activity of wild-type Epo (□) and mutein K154A with increased 
biological activity relative to tiie wild type Epo, is shown over a range of 
dosages (X-axis) as determined by die relative incorporation of pHl-diymidine 

IS (Y-axis) into primary murine ^leen cells. 

Figure 34. Bioactivity of Epo Replacement Muteins. Hie letters in 
die column on die left designate predicted helices (A, B. C. D) or interhelical 
loops (A-B, C-D). The 3 'D designation (3' of D) in die table represents 
amino acids 153-166 which are at die carboxy-end of die D-helix. The amino 

20 add of wild type human Epo, its position from die N-te.rminus, and its 
replacement according to die angle amino acid letter code, are listed under die 
mutein column, e.g. S9A, represents a mutation from die wild-type human 
Epo at die amino add serine of position number 9 to alamne. The three 
bioassay columns (human UT7 cell line, murine HCDS7 cell line, and primary 

25 mouse spleen eryduoid cdls) show spedfic bioactiidly of each mutdn, 
e3q>ressed as a percentage of wild type human bioacdvity, widi die 
background COS supematam alone subtracted from die value. The mean and 
standard deviation of die bioassay values are listed for n^3 determinations. 
The numbers mdiin die brackets indicate die number of sqsarate COS cell 

30 transfecdons over (/) die number of sqmraie bioassays diat were performed. 
NS: not seoeted from COS 7 cells. 
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Dejinitiotts 

In order to provide a clearer and more consistent understanding of the 
specification and claims, including the scope to be given such terms, the 
following definitions are provided. 

Administration. The term "administration" is meant to include 
introduction of ti»e Epo muteins of tiie invention into an animal or human by 
any ^propriatc means known to the medical or veterinary art, including, but 
not limited to, injection, oral, enteral, and parenteral (e.g., intravenous) 
administration. 

Amino Add Codes, The most common amino acids and tiieir codes are 
described in the following table: 
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TaWel 
Amino add names and codes 



Amino acid 


Single letter 


Three letter 


code 


code 


Alanine 


A 


Ala 


Arginine 


R 


Arg 


Aspanic acid 


D 


Asp 


Asparagine 


N 


Asn 


Cysteine 


C 


Cys 


Glutamic acid 


E 


Glu 


Glutamine 


Q 


Gin 


Glycine 


G 


Gly 


Histidtne 


H 


His 


Isoleucine 


1 


He 


Leucine 


L 


Leu 


Lysine 


K 


Lys 


Methionine 


M 


Met 


Phenylalamne 


F 


Pbe 


Proline 


P 


Pro 


Serine 


S 


Ser 


Threonine 


T 


Thr 


Tryptophane 


W 


Tip 


Tyrosine 


Y 


Tyr 


Valine 


V 


Val 



Animal. The term "animal" is meant to include all aiumak whose 
25 production of red blood celk (erythrocytes) is dependent upon, or stimulated 
by. erythropoietin (Epo). Foremost among such animals are humans; 
however, die invention is not intended to be so limiting, it being withm the 
contemplation of die present invention to treat any and all animals which may 
expeiience the beneficial effect of die Epo muieins of die invention. 
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Efficacious Amount. An "efficacious amount" of an Epomuiein of the 
invention is an amount of such Epo mutcin that is sufficient to bring about a 
desired result, particularly the stimulation of led blood cell production, 
especially upon administration to an animal or human. 
5 £iyih„,c>tt.Theterm"erythrocyte-isintendcdtoiefcrtoaredblood 

cell. 

E^nid Pruunor CeUs. By "erythroid precursor cells" is intended 
cells with a capability to proliferate and differentiate into erythrocytes that is 

dependent upon exposure or contart with Epo. 

10 Efylhnpoietin (Epo). The tcm "erythropoietin", or "Epo" is 

primarily intended to refer to the mature form of this hormone. The terms 
-wild type Epo" and "native Epo" are used interchangeably to refer to Epo in 
its naturally occurring, umnodified form. Unless otherwise indicated, amino 
acid positions on tt>e Epo protein referred to herein ate made with lefeience 

15 10 the mature. 166 amino acid human Bfo sequence shown in Rguie 6 and 
corre^nfing sequences from other speas&. 

Host Cdl. By "host" or "host cell" is intended the cell in which a gene 
encoding an Epo mutein of ti>e invention is incorporated and expressed. An 

Epo mutein gene of tiie invemion may be imroduced imo a host cell as part 

20 of a vector by transfomaripn. 

. MhIob. By -muteinMsmeam a miitaw or modified protein with one 

or mote modifications to its native amino acid sequence in the ftmn of amiro 
acid additions, deletions, or substitutions. An erydjiopoietin mutein lefos to 
a modified erytiirqwietin protdn. 
25 PhannaceutieaUy AeuptabU VehicU. The term "phatmaceuticany 

acceptable vehicle" is intended to include solvents, carrien. diluents, and the 
Uke. which are utilized as additives to preparations of tiie Epo muteins of the 

imnmtion so as to provide a carrier or adjuvant for die administration of such 
Epo mutdns. 

30 Tnn^oma&on. By "transformation" is intended tiie act of causing a 

host cell to contain adesired mideic acid molecule, including either a native 
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Epo gene or a gene encoding an Epo mutein of the invention, not originally 
part of that cell using methods known in the art. 

Tranrfecdon. By "iransfection" is intended the introduction of a DNA 
or RNA vector carrying a desired nucleic acid molecule, including either a 
5 native Epo gene or a gene encoding an Epo mutein of the invention, into a 
host cell. 

Vector. By "vector" is intended a DNA element used as a vehicle for • 
cloning or expressing a desired gene, such as an Epo mutein gene of the 
invention, in a host 

10 Detailed Description of the Preferred Embodiments 

This invention is drawn to modifications of Epo which enhance its 
biological activity. Due to dieir enhanced activity, the Epo muteins of the 
invention provide a preferable alternative to ntive Epo where it is used to 
stimulate red blood cell production (e.g. treatment of blood disorders, cell 

IS culture, etc.). 

The present invention teaches a ^ctural model of die Epo protein 
useful for identifying functional r^ipns which may be modified to enhance 
biological activity. Aocordipg to die invention; ^ is prediciBd tt> have a 
four anti-patallel amphipaduc a-4ielical bundle structure. Based q)on Ais 

20 predicted structure, functionally important regions predicted to reside on die 
external sur&ces of die helices (designated A-D, see Figures 9A*9B) are 
identified. In particular, die external surfaces of hdices A and D, predicted 
to be involved in Bpo receptor bindiqg, are identified. Aooordiqg to die 
invention, diese functionally important r^ions serve as targets for 

25 modifications which may enhance biological activi^. Im^reased activity 
relative to die wild-iype Epo can be found widi amino add substitutions in die 
A,B,C or D helix. 

According to one zspecx of die invention, Epo muteins are provided in 
vAiich one or more of die amino acids mdiin the predioed external surfaces 
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of helices A. B, C. D and 3' of D, or within the region immediately adjacent 
to the C-terminal end of helix D, have been replaced. In particular, amino 
acid substiiutions at positions 20. 49. 73. 140. 143. 146. 147 and 154 of wild 
type Epo have been replaced. The inventors have discovered that subsUtuUon 

5 of the amino acids normally occurring at these positions results in Epo muteins 
with significantly higher biological activity than wild type Epo proteins. 

According to the invention, the amino acids at position 20 (typically • 
alanine). 49 (typically serine). 73 (typically glycine). 140 (typically lysine), 
position 143 (typically aiginine), position 146 (typically serine), position 147 

10 (typically asparaginc), and 154 (typically cither lysine or threonine) of wild 
type Epo are preferably replaced with an alanine residue. In addition to 
alanine other amino acids with a chemical structure and properties similar to 
alanine, such as serine and Uireonine. arc contemplated by the invention as 
suitable subsiinites for achieving Epo muteins of Uie invention witii enhanced 

15 biological activity. 

In anodier aspect of the invention, a portion of ftc predicted interloq) 
region between helices A and B extending fiom amino acid 48-52 is identified 
as important to Epo fiinction. According to die invention, fliis functionally 
important region serves as anoflier target for modifications which may enhance 

20 biological activity. The iiiventois have discovered that substitution of the 
amino add at position 49 results in Bpo muteins with signlficaittly higher 
biological activity dan wild type Epo prolans. 

' According to die invention, the amino acid at position 49 (typically 
tyrosine) of wild type Epo is prefsrably replaced with a serine residue. Other 

25 amino ai^ wrtdi a dientical stnicture and properties similar to soine, such 
as alanine and tiueoiune, are contemplated as suitable substitutes for adiieving 
1^ muteins of die invention witfi enhanced btolt^al activity. 

The modifications taught by tiie presem invention occur in «&oos diat 
are highly conserved among Bpo proteins from evolutionarily divergem species 

30 (see Example 1 and Figure 6 in particular). These modifications are thus 
expected to be broadly applicable to Epo protdns which are substantially 
similar to die human Epo protein in die regions where tije modifications occur. 
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including, but not limited to. Epo proteins from monkey, mouse, rat, sheep. 

pig, cat, and dog. 

Biological activity of Epo mutelns can be determined by assaying the 

ability of these proteins to stimulate the proliferation of Epo responsive cells 
5 such as murine spleen cells (Krystal, G.. Exp. Hmatol. ll'.M9-60 (1983); 

Goldberg, M.A.. et cL. Proc. Natl. Acad. Sci. USA W:7972.7976 (1987)). 

murine Epo responsive murine crythroleukemia cells (Hankins, W.D.. ei al.. ■ 

Blood 70:173a (1987)). and human Epo^ependent UT-7/Epo cells (Komatsu, 

N.. et al.. Cancer Res. 57:341-348 (1991)). 
10 When assaying for biological activity, it is important to recognize that, 

as taught by the present invention, high levels of biologically active Epo can 

cause terminal maturation and cessation of division of Epo responsi ve cells. 

Because of this possibly confounding phenomenon, it is important to determine 

biological activity based on a fuH dose response curve over a broad range of 
15 dosages whidi includes non-sawrating levels of the particular Epo mutein 

being assayed. 

Although die mechanism by which the nmdificatiohs taught by the 
invention increase biological activity is not completely understood, one 
possibility is that these modifications enhance receptor binding affinity. This 
20 possiViliiylsihoHghtioapplyparticulariyiothemodlficatfonsatpositioiB 

49. 73, 140. 143. 146. 147. sod 154. which lie wUhin a wglon predicted to 
be involved in Epo receptor bindiiig accordii« to the teachings of Ihc present 

inventioiu 

Genes Encoding Epo muteins ^ mutein genes) 

25 In addition to die Epo muteins tiiemselves, genes encoding these 

muteins are also contemplated by the present invention. Genes encoding the 
Epo muteins of the invention may be made by modification of the native ^ 
gene using standard mediods well known to those of skill in tiie art 

The native Epo geat may be obtained using a varieQr of standard 

30 techniques. For example, a DNA molecule correspondii« to the known 
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sequence of the Epo gene from a number of species may be artificially 
synthesized (See Example 1 and Figure 4). The Epo gene may also be isolated 
from a cDN A or genomic library using nucleic acid probes based on available 
Epo DNA sequence information and standard hybridization techniques as 
5 taught in U.S. Pat. No. 4.703.008. issued Oct. 27. 1987. which is herein 
incorporated by reference (Sec also Jacobs et al.. Nature 313: 806-810 
(1985)). Alternatively, amplificaUon of Epo gene sequences by polymerase • 
chain reaction (PGR) may be accomplished using primers based on available 

sequence data as ought in Example I. 
10 Once obtained, the wild type Epo coding sequence can be altered to 

encode an Epo mutein of the invention using standard oligonucleoude-directed 
in vitro mutagenesis techniques (See Zoller. M. and Smith. M., Methods in 
Emymlogy 100: 468-500 (1983); Dcnte. L. etal.. Nucl. Adds Res. 11: 1645 
(1983): Kunkd. T.A.. a/.. Me/A. £«y/no/. 754:367-382 (1987): Vande^^^ 

15 M. et al.. Gene 65: 129 (1988)). These techniques generally involve 
hybridization of an oligonucleotide carrying the desired alteration to a single 
stranded target DNA. This Is foHowed by complementary strand synthesis of 
the target DNA which incorporates the oligonucleotide canylng the desired 
alteration. The complementary strand containing the desired alteration is then 

20 propagated in a bacterial host cell. Specific appUcation of tiiis type of 
technique to inodify die Epo gene U described in Examples n and in and in 

Australian Patent Publication No. Al)-A-59145/90 by Fibi et al. and in U.S. 
Pat^ No. 4.835.260 by Shoemaker, both of which are herein incorporated by 
refereitte. 

25 Expression of Epo mutein Genes 

Genes encoding Epo muteins of die invention can be introduced and 
expressed in a selected host cell system using conventional materials and 
techniques. DNA elements such as promoters, enhancers, polyadenylation 
sites, transcription termination signals, and ti»c like should be associated witii 
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the Epo mutein coding sequence so as to promote and contiol expression of 
the Epo mutein. TTie specific regulatory elements used will depend upon the 
host cell system selected for expression, whether secretion of the protein is 
desired, and other considerations readily apparent to one of skill in the art. 

Various vectors may be employed as vehicles for the introduction and 
expression of Epo mutein genes in a host cell. Such vectors useful in the 
differem host cell types are well known and include, for example, the- 
„ammalianexpre^ionvectorspSG5(Straugene).p-RKl(Geneticslnstitute). 

P-SVK3 (Pharmacia). p-EUK-Cl (Clontech). pCDM (Invitrogen), pc DNAI 
(invitrogen). and the bacterial expression vectors pFLAG-1 OBI), all pET 
system pbsmids (Novagen). pTrcHis (Invitrogen). die pGEX series 
(Pharmacia), and pKK 233-2 (Clontech). These vectors may be maintained 
as episomes in die host cell or tiiey may facilitate integration of die Epo 
mutein gene into the host cell genome, or both. Vectors may also include 
oti«r useful features, such as genes which allow for Uie selection or detecuon 
of cells in which they have been successfully introduced. 

Host cells suitable for expression of die Epo muteins of die invention 
i«dude. but are not necessarily limited to. E. coU, yeast, insect cells, plant 
belU. and a variety of mammalian cell types, including in particular Oa^ 
20 Hamster Ovary (OTO) cdb. Cos? cdb. Cosl cells, baby hamster k^^^^ 
cell?, and CVl cells. ^ mutein genes can be introduced into a host cell 

usii« standard transformation or transfection techiuques. 
• HostcdlswWchptovideforglycosylationarepiefcnedforproduction 

of Epo muteins to be used to Ww since the caibohydiate stroctwe of Epo b 
25 important for optimal biologfcal acdvhy in vfw (See Doidal. M.S. et al.. 
fiuJocnwtogy 2293-2299 (1985)). 

The Epo muteins of the invention may be iecovered and purified from 

host cells In which tiicy an^ expressed using known methods such as 
immunoaffinity chromatography widt antibodies to human Epo. 



15 
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Thempeutic Use of Epo muteim 
of the Invention 

The Epo muieins of the invention arc suitable for therapeutic use in . 
animals, particularly humans, and may be used in the same manner as wild 
5 type Epo. except that lower dosages will be required to achieve the same level 

of biological activity. 

Administration of the Epo muteins of the invention may be 
accomplished by any of the methods known to the skilled artisan, provided 
that the method effectively places the Epo mutein in the appropriate 
10 environment to exhibit its activity (i.e. in contact with erythtoid precursor 
cells). A pitferred method is by parenteral routes, including intravenous and 
subcutaneous administration. 

Administration will ordinarily include an efficacious amoum of the Epo 
mutein supplied in a pharmaceutically acceptable vehicle. The amoum of Epo 
15 mutein «*ich Is efficacious wlu be determined by a trained professional 
dependmg upon a mimber of corisidcntions. including Ae condition being 
treated and its severity, tiuj sex and body weight of the subject being treated, 
tiie meUwd of administration, and the presence Of compositions in tiie 
phaimaceuticaUy acceptable vehicle which affect Epo activliy. 
20 ; In any event, the efficaclotis amoum of a ^mweiii as taught Iqr the 
invention will be sigitificanfly less tiian tiie conesponding amoiral of 
unmodified Epo needed to achieve die same degree of efficacy. The 
difference between the efficacious amount of the Epo mutein and the 
analogous amount of uranodlfied Epo wlU conespond wifli die Increased 
25 biological activity exhibited by tiie Epo mutein. It is contemplated that an 
efficacious amoum of an Epo mutein of die invemion will be as low as ICfold 
less than native Epo. or as low as about 5-10 unitsflcg body weight for treating 
chronic renal failure patients or about 10-30 unitsAcg body weight for treating 
HIV-infected patients witii serum Epo levels less flum or equal to 500 mU/ml 
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(miUiunits/miliniter) who are receiving a dose of AZT equal to. or less than. 
4200 mg/week. 

Since the Epo muteins of the invention can be used effectively at lower 
dosages relative to wild type Epo. their use may reduce potential adverts 
5 effects associated with administration of Epo such as exacerbation of 
hypertension, seizuies. headaches, tachycardia, nausea, clotted vascular 
access, shortness of breath, hyperkalemia, and diarrhea (see PDR. 1993 

edition, at pp. 603-604). 

Further description of methods and materials useful in die recombinant 
10 production of die Epo muteins of the invention and Uieir use is provided in the 
teachings of U.S. Patem No! 4,835.260 by Shoemaker and U.S. Patem 
No. 4.703.008 by Un. both of which are herein incorporated by reference in 
their entireQr. 

The invention may be better understood and appreciated by reference 
15 to the following examples. These examples aie provided merely for 
illustrative purposes and should not be considered in any way as limiting the 
scope of the clttiined invention. 

EXAMPLE I 

EtythmpoUdn Stnutun-Fluulbn ROaiiotaUpK 
20 High Degneef Sequence UomhgyAnmgMonmub 

Summary 

In Older 10 invcstigaic stnicniie-fimction relationships of oyd»oi» 
(Epo). we have obtained (©NA sequences that encode the mature Epo pi^^ 
of a variety of mammals. A first set of primers, corresponding to conserved 
25 mideotide sequences between mouse and human DN As. allowed us to ampUfy 
by PCR intron 1-cxon 2 ftagments from genomic DNA of hamster, cat. lion, 
dog, horse, sheep, dolphin and pig. Sequencing of these fiagmems peimitted 
die design of a second generation of species-specific primers. RNA was 
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prepared from anemic kidneys and reverse-transcribed. Using our battery of 
species-specific 5' primers, we were able to successfully PCR amplify Epo 
cDNA from Rhesus monkey, rat. sheep, cat and pig. Deduced amino acid 
sequences of mature Epo proteins from these animals, in combination with 
5 known sequences for human. Cynomolgus monkey and mouse, showed a high 

degree of homology, which explains the biological and immunological cross 
reactivity that has been observed in a number- of species. Human Epo is 91 % • 
identical to monkey Epo. 85% to cat Epo and 80-82% to pig. sheep, mouse 

and rat Epos. There was full conservation of a) the disulfide bridge linking 
10 the NH, and COOH termini; b) N-glycosylauon sites; and c) predicted 
amphipathlc a-helices. In contrast, the short disulfide bridge (C29/C33 in 
human) is not invariant. Cys33 was replaced by a Pro in rodenu. Most of 
the amino acid replacements were conservative. The C-tcminal part of the 
loop between the C and D helices showed die most variation, witii several 
15 amino acid substitutions, deletions and/or insertJons. Calculations of 
maximum parsimony for intron 1/exon 2 sequences as well as codiiig 
sequences enabled tiie construction of cladogiams titat are in good agreement 
wid» known phylogcnetic relationships. 

INTRODUCnON ^ 

20 Alaigenumberofeariyphysiologicstudieshavecstablishedextensive, 
titoligh not complete, biological cross-reactivity between Epos of man and a 
number of other mammals including mouse. lat, sheep and labbit (Jdkmann. 
W.. n^ol. Rev. 72M9 (1992): Garcia. J.F. et al., Proc. Soc. Exp. Biol. 
Med. lJ2:m-m (1962); Zanjani. E.D. etal., Proc. Sac. Esqt. Biol 

25 tied. J3J:1095-1098 (1969); Biljanovic-Paunovic. L. etal.. Period. Biol. 
90:421-427 (1988)). In contrast, non-mammalian vertebrates (amphibians 
(Rosse, W.F., et al.. Blood 22:6i-12 (1963): Finder, A. et oL, J. Exp. Biol. 
/Qy:2Q5-213 (1983)). birds (Rosse. W. et oL, Blood 27:654-661 (1966); 
Black, CP. et al., Respim. Physiology 39:217.239 (1980)), fish (Zanjani. 
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E.D.. et al.. Science 33:513 (1969)) have erythropoietic hormones that fail to 
cross-react with mammalian crythroid cells, and vice-versa (Rosse. W.F.. 
et al.. Blood 22xfAni (1963); Pinder, A. et ai, J. Exp. Biol. 705:205-213 
(1983); Rosse, W. etal., B/oorf 27:654-661 (1966)). 

TTius for. the Epo genes of man (Jacobs. K.etal., Nature 575:806-10 

(1985) ; Lin. F.-K. et al.. Proc. Natl. Acad. Sd. USA 82:7580-7584 (1985)), 
a monkey {Macacafasdcularis) (Un. F.-K. etal.. Gene -«:201-209 (1986)) : 
and a rodent, the mouse (McDonald. J.D. et al., Mol. & Cell Biol. 6:842-848 

(1986) : Shoemaker. CB.etal, Mol. & Cell Biol. 6:849-858 (1986)) have 
been cloned, sequenced, and expressed. In view of the marked cross, 
reactivity between mammalian Epos, it is not surprising that there is a high 
degree of sequence homology in the coding region of the mature secreted 
proteins. In keeping with their close phylogenetic relationship, human and 
nwnkey Epos arc 94% and 919J identical in nucleotide and amino acid 

15 sequence respectively. Incontrast, human and mouse Epos are 76« identical 
in nucleotide sequence and 80% identical in amino acid sequence. 

Direct information on the three-dimensional sirocture of Epo is not yet 
available. Insights into structure-function retationships of Epo can be gained 
from the analyses of a more complete set of animal sequences. Such 

20 information could be useful for sequence-based computer modeling of three, 
dimensional structure. Moreover, a larger data base would pennit the 
identification of highly conserved domains that are likely » be crucial to 
folding and/or biological function. Finally, comparative amino add and 
nucleotide sequence information on Epo provides additional data foe 

25 investigating phylogenetic relationships. 

In this Example, the use of PGR based techniques for obtaining the full 
coding sequences of Epos of Rhesus monkey, rat. sheep, pig and cat, as well 
as partial sequences from four otiier species is demonstrated. 
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Materials and Methods 

Animal Samples. Human hepatoma Hep3B cell line and multiple 
kidney-derived cell lines from hamster (BHK). sheep (MDOK). pig (LLC- 
PKl). dog (MDCK), cat (CRF-K). lion (PALl-K) and spotted dolphin (SPl-K) 

5 wet« obtained thiough the American Type Culture Collection (American Type 
Culture Collection. 12301 Parklawn Drive. RockvUle. MD 20852).- 
Monolayer cells were grown in 100 x 20 mm tissue culture dtehes using 
recommended media and maintained in a humidified 5% COj/95% air 
incubator at 37'C. In some experiments, cells were made hypoxic by 

10 overnight incubation in 1 « Oj, 5% CO,. 94% Nj at 

Poly A+ RNA prepared from Rhesus monkey kidney was purchased 
from Clontech (Palo Alto. CA). Kidneys from anemic cat and horse were 
obtained from veterinarians at Tufts School of Veterinary Medicine and the 
University of Nebraska, respectively. Anemia was induced by three 

15 consecutive daily intraperitoneal injections of phenylhydrarine (60 mg/l^ body 
weight) into male Sprague-Dawley rats (SD-sttain) and by repeated bleediitg 
of the sheep and die pig; Kidneys were aseptically removed after induction 
and stored immediatdy in liquid lutrogen. 

A 4 kb genomic human Epo clone (g Epo4) was provided by Genetics 

20 Institute (Boston, MA). 

DNA and RNA Pnparations. DNAs ftom cell lines or homogenized 
kidneys were extracted using pancreatic RNAase/SDS/proteinase K following 
a procedure modified from Blin and Stafford (Blin, N. et at., NuddcAdds 

Jte. 5:2303^2308(1976)). 
25 Kktaeys were homogenized in 4 Mguanidineisotiiiocyanate (10 ml/g 

of froien organ), containing 0.1 M beta-mercaptoetiianol and 10% N-lauryl- 

saroodne. Total RNAs were isolated by centrifiigation over 5.7 M CsCl 

- (ChiiBwin. J.M. etal., moduamstry i&5294 (1979)). After ethanol 

precipitation, the samples were resuspended in diefliyl pyrocartwnaie-trrated 

30 water and stored at -70°C. Confluent monolayer cells were washed twice with 



wo 94/24160 



PCTAJS94y04361 



-32 



. sterile phosphate buffen^ saline and direcUy lyscd in the 4 M guanidine 
isothiocyanate solution. Total RNAs were isolated as described above for the 
kidneys; 

RNA samples were first converted into single strand cDNA. 2 to 4 Mg 
5 of total RNA from kidney or 500 ng to 1 ME of total RNA from cultured cells 
were denatured at 68-C in the presence of 2 of oligo dT<.,, Reverse 
transcription was carried out in a 20 m1 final volume, containing 50 mM Tns- • 
HCl pH 8.3. 6 mM MgCl,. 40 mM KCl. 10 mM DTT. 500 |.M dNTPs. 20 
u RNAsin (Promega. Madison! Wl) and 20 u. of avian myeloblastosis virus 
10 reversetranscriptaseCAMV-SuperRT.MolecularGeneticResources. Tampa. 
FL). The reaction was allowed to proceed at 37-C for 20 minutes, then 
at 42'C for one hour. Inactivation of the enzyme was performed at lOO'C 
for 10 minutes. cDNA samples were stored at -70'C. 

Polymerase Chain Reaction. 20 to 100 ng of genomic-DNA or 0.5 
15 u,i;tl of the RT-reacUon were used as templates for amplification. 

Standard PCR reactions were performed in a 100 ,i\ voltiiiie 
containing 50 mM Tris-HCl pH 8.3. 50 mM KCl. 1.5 mM MgCl,. 0.01 % w/v 
gelatin. 200 mM dNTPs, 30 pM of each sense (5') and antisense (3') pnmers 
.and 0 5 units of Taq DN A polymerase (Peridn Elmer Cctus. Emeryville. CA). 
20 After an initial denamration step for 5 minutes at 95-C. routinely 35 cycles 
of PCR were performed on the DN A Thermal Cycler (Peridn Etacr Cetns). 
mdenaturation step of each cyde was carried out at 94-C for imin. For 
. each sample and/or the primer pair used, amualing temperatures were 
optimized (from 4rC to 60«Q for a 2-minute reaction. A 3-minute 
25 elongation siep at 72'C ended each cycle. 

Amplification products were analyzed on 1 to 3% agarose gel-TAE. 
Nu Sieve or Seaplaque agaroses (FMC. Rockland. ME) were used for 
preparative purposes. Specific PCR products were recovered ftpm the gel. by 
the use of the Gene Clean n kit (Bio 101, La Jolla, CA). by Spin X 
30 centrifiigation (Costar. Cambridge. MA) or by standard phenoVcWoroform 
Qctiacdons of melted gel slices. 
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Sttbcbning and Sequencing. Small (250-330 bp) amplified genomic 
fragments were direcdy subjected to automated sequencing, using tlic PGR 
primers as DNA sequencing primers. 

PCR-geneiated cDNA products were blunt-ended using Klenow 
5 fragment and 5' phospborylated by T4 DNA kinase (Boehringcr Mamiheira. 
Indianapolis, IN). They were then cloned into the Smal site of the phagcmid, 
pBluescript II SK (Stratagenc. U Jolla. CA). The double-stranded plasmid 
was sequenced using the SK and KS sequencing primers. 

In all cases. Sequencing reactions were carried out on an Applied 
10 Biosystems 373A Automated DNA Sequencer utilizing ABl's DyeDeoxy 
Terminator Sequencer kit (Foster City, CA) and thermal cycling with Taq 
DNA polymerase (Promega, Madison. WI) as previously described (Tracy. 
T.E. etaL, Biotechniques 7/:68.74 (1991)). To avoid errors, for each 
species, saraplesderived from differentamplification/cloningcxperimentswere 

15 prepared and both strands were sequenced. 

MammaliM Expressbn; BbacdvUy. Vectors containing fiiH-length 

mammalian Epo coding sequences were subcloned into pSG5 plasmid 

(Pharmacia. Piscataway. NJ) and were transiendy expressed in Cos? cells. 

72-hour supematants wet* tested for their ability to sustain cellular 
20 proliferalion of the Epo^iependent HCD57 cell line (Sawyer. S.T. et d., 

Bltibd n (suppl. 1): 132a (1988)). 

Results and Discussbtn . 

Because of the known biological cross-reactivity between various 
mammalian Epos and the presumption of strong homology in the coding 
25 sequences, a logical doning strategy would be to screen cDNA libraries with 
modeiBie to low stringency. However, Epo mRNA is expressed at barely 
detectable levels in 0 tissues except in hypoxic kidney and liver (CSoldberg. 
M.A. et al.. Proc, Natl. Acad. Sd. USA &4: 7972-7976 (1987); Fandrey. J. 
aal.. Blood 81: 617-623 (1993)). Therefore it would be necessary to 
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generate libraries for each of the species of interest. Our primary research 
interest is in structure-function relationships of Epo and therefore we require 
full coding sequences of *e mature Epo protein, rather than full cDNA 
sequences. Accordingly, we elected to employ a PCR cloning strategy 
predicated on known strong homologies in Epo cDNA sequences that flank the 
region encoding the mature protein. n«re was sufficiem conservation of 
sequence around the NCOl site in the 3' untranslated region (UTR) that a 
single primer could be used to amplify full length mature Epo coding sequence 
in rat. sheep, pig and cat as well as dog and rabbit (data not shown). 
However, we were unsuccessful in finding a universal 5' primer and therefore 
had to generate species specific sequences from amplifications of genomic 
DNA. This two stage PCR strategy proved to be satisfactory in generaong 
highly accurate sequence information with a minimum expenditure of time and 

resources. In all insumces. there was fiill agreemem between sequences of 
15 complementary strands. 

PnpantionofMammaUemBpocVNAs 

Primer Design for PCR AmpUfication of Epo. Thescjquencesof Epo 
genes from three species have been already publislujd: two pri^^^ 
(Jacobs. K. er a/.. Nature 3/5:806-810 (1985); Un, F.-K. «f cL, Proc. Nad. 
20 Acad Sd. USA 52:7580-7584 (1985)) and Cynomolgus monkey (Macaca 
fasdcularis) (Un. F.-K. et al.. Gene 44:20U2O9 (1986)) and a lodeitt. the 
r^iMus musadus) (McDonald. J.D. ei al., Mol. & Cell Biol. 6:842-848 
(1986); Shoemaker. C.B. er al., Mol. d ailBiol. 6:849-858 (1986)). THere 
is substantial homology at the nucleotide level not only in the codtng 
25 sequences, but also in portions of introns and in 5' and 3' untranslated 
sequences of cxons 1 and 5. respectively. A number of primers, 
corresponding to conserved micleotide sequences between mouse and-human. 
were synthesized and tested for their ability to produce genomic-PCR 
ftag^ ftom a wide variety of mammalian cell Unes. A 4 kb human 
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genomic clone and genomic DNA extracted from the human hepatoma cell line 
Hep3B were used as control templates for the PCR amplifications. 

In particular, two pairs of primers. IV1/EX2R and EX5/NC01 
(Figure 1), directed the amplifications of fragments of 330 and 250 bp 

5 (respectively) from genomic DNA of all human and mammalian cell lines that 
we. investigated. IVl is a 25-mcr oligonucleotide localized in intron 1, 
216/217 bases upstream from tite start of exon 2. Human and mouse are 
identical except tiat the human sequence contains an extra G nucleotide at 
position 996. IVl primer corresponding to the mouse sequence was 

10 synthesized. The 23-mer EX2R ends 28 nucleotides upstream from the 3' end 
of exon .2. EX5. is a 22-mer, beginning 34 bases downstream from the 5' end 
of exon 5. NCOl represents a 20 bp. 100% conserved DNA fragment, 
starting 1 12/1 13 bases downstream from the TGA stop codon and contains the 
umque NCOl site present in tiie human and mouse gene. 

15 An ATG primer was also syndjeslzed, corresponding to a 20 

oligonucleotide stretch, extending bofli 5' and 3' from tiie initiator methionine 
codon. This sequence is also tptaUy conserved between primates and mouse. 
While combinations of exonic primers (ATG1EX2R, EX5/NC01 and 
ATG/NCOl) were able to direct amplification from cDNA prepared from 

20 RNA of a»e human Epo-producing cell liiJe Hep3B, no ampUfication was. 
obtained on cDNA prepared from uninduced or hypoxia-induoed BHK, 
MDOK. LLC-PKl. MDCK. CRF-K, PALl-K and SPl-K cdU (results not 
shown). Therefore tiiese renal cell lines apparenfly do not express Epo 
mRNA. eitiier constitutively or after hypoxic stress, at a level detectable even 

25 by RT-PCR. Furtiiermore, no Bpo protein was detectable by 
radioimmunoassay in tiie supematants of cells mMntmned overnight in 1 % 0,, 
5% COi. 94% Nj at 37X (Goldberg, M.A. et al., Proc. Natl Acad. Sd. 
DIM W:7972-7976 (1987)). 

The amplification of species specific IV1/EX2R and EX5/NC01 

30 genomic fiagments allowed us to design a strategy shown in Figure 2 for 
obaining cDNA clones containing Uie complete coding sequence of mature 
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Epo protein fiom various mammals. The IV1/EX2R primer pair resulted in 
agenomicfragment(-330bp)corresponding tothc3' third of die first intron 
and 80% of cxon 2. enaxling the COOH-end of the propeptide and the NH,- 
terminal amino acids of the mature protein. EX5/NC01 primers amplified a 

5 segment of exon 5. containing the COOH^nd of the Epo protein and a 3' 
untranslated sequence of about 140.bp downstream of the stop codon: DNA 
sequencing of these 5' and 3' fragments permitted, if necessary, the design of • 
a second generauon of species-specific primers (SP). localized outside the 
coding part of the mature protein fi-e.. in the sequences encoding the 

10 propeptide and the 3' untranslated region). 

AmpUficatbn of Piutiallntnn i/ Exon 2 Genomic Oones. WefiKt 

explored the efficacy of the IVl and EX2R primers for amplification, of 
genomic fragments from differem purified DNA templates. In all the 
reactions, fragments of predicted size were the main (and. when using the 
most stringem am«aling temperature, often the only) products detected by 
agarose gel electrophoresis. TTie (direct) sequences of tiies«5 PCR-generated 
fragments are presented in Figure 3. TTie nucleotide alignment includes 6 
different orders of mammals: one primate, human {Homo sapiens)', two 
aruodactyls, sheep (Oynsanes) and pig {Sinsscnfa)\ oneperissodactyl. horse 
{Equus caballus)', one cetacean, spotted dolphin (SteaeUa p\a&odott)\ three . 
carnivores, dog (CamsfamiUaris), cat {FeBs aim) and lion (Pmthem leo)\ 
two rodents, mouse (Afw musadus) and hamster {Cricem cricetus). Gat and 
lion exhibited an almost complete sequence identity, with only one T to G 
nucleotide substitution near tiie 5' end of tiie amplified portion of flic fiist 
25 intton. 

The ability of die intronic IVl to anneal to genomic DNA from various 
spcciesand,incombinationwid»EX2R.toamplifyPCRfragmentsofidentical 

size, demonstraied a high degree of conservation of sequence and position 
between mammals. This suggests tiat IVl sequence may be involved to some 
30 extent inihe icgulation of the Epo gene expression. TTie comparison of 
IV1/EX2R sequences also levealed two remarkably conserved intronic 
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sequences: ATCr/A)GAATGAA(G/C)GC (SEQ. ID NO: 1] (nucleotides 1046 
to 1058 in the human gene) and A(A/DGGTN(G/OGGG [SEQ. ID NO: 21 

(nucleotides 1085 to 1094). 

Amptificatbn of Partial Exon 5 Fragments from Genomic DNAs. 
5 PCR reactions were also performed applying the EX5/NC01 primer pair to 
various purified genomic DNAs. As previously observed for the amplification 
of IV1/EX2R. in all the tested samplw a unique main band of about 250 bp 
(as predicted for human and mouse genes) ivas detected on analytical agarose 

gel electrophoresis. Direct DN A sequencing of these fragmcms demonstrated 
10 that the bulk of the purified PCR product corresponded to the expected Epo 
exon 5 sequence. However, for some DNA samples, purified from horse 
kidney and several cell lines (BHK and LLC-PKl In particular), unknown 
sequences were present to various extents. These minor contaminating PCR 
products generated extraneous peaks in the sequence data, resulting in 
15 ambiguities at several positions. Nevertheless, computer^dited analyses of 
generated 3' non-coding sequences were sufficient to design, if necesaiy, 
specific primers witii greater than 95% accuiacy (data not shown). 

Cloning of Partial cDNAS Encoding the Complete Maturt Epo 
Protein. As we previously mentioned, we were unable to obtain any 
20 amplification wifl> cDN As prepared from lenal-derivedinainmaliancd^ 

Tlieiefore. we tested our battery of primers on leverse^ianscribed RNAs 
prepared from kidney of several mammaU. We have succesfiiUy aniplified 
^ cDNA from: one primate. Rhesus monkey (Wacaca iiwflflte), one 
carnivore, cat {FeUs cam), one rodent, rat (Mm norvegicus) and two 
25 artiodactyls. pig (5« scrqfa) and sheep {(Ms ones). The aligned nucleotide 
sequences arc prcsemed in Figure 3. They have been submitted to GenBank 
and have been assigned accession numbers. 

Rhesus monkey, lat and sheep cDNAs were amplified using tiie 
ATCWCOl primer-pair combination. Pig and cat sequences vete obtained 
by the use of a 5' 24-iner specific primer {SPl-5' 
TXXTIXTCCTATtTmKnXSCrcC 3') (SEQ. ID NO: 3]. IV1/EX2R 
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sequencing demonstrated that this exon 2 sequence was shared by the two 
species. In all PCR amplifications. NCOl was used as the reverse primer. 
Conservation of the NCOl sequence among mammals suggests a potential 
biological role of this nucleotide sequence (perhaps in modulating the stability 

5 of Epo mRNA) (Rondon. L J. etai, J. Biol, am, 266:16594-16598 
(1991)). Considerable homology is also found in the sequenced 3' 
untranslated segment of exon 5 (Figure 4). 

When compared with human Epo. the overall percentages of nucleotide 
identity are, lespectively: 93.5 for the Rhesus monkey. 86.7 for the cat, 85.7 

10 for die pig. 83 for the sheep and 75.5 for the rat. Rhesus monkey and 
Cynomolgus monkey (Un. F.-K. et al,. Gene 4*201-209 (1986)) were 99.6% 
idcndcal. Mouse (McDonald, J.D. et al:, Mol. A Cell Biol. 6:842-848 
(1986); Shoemaker. C.B. et al., Mol. & Cell Biol. 6:849-858 (1986)) and rat 
nucleotide sequences showed greater than 93 % identity. The two sequenced 

15 artiodaoyls, pig and sheq). showed 88% idendty.' 

Comparison of Mammalian Epo Frimaty Sequences 

PropQjlWej^ The predicted anuno add prt^cptidc sequences of s^^ 
mammais are presented in Rguie 5. In the primates, there was strong 
OHKervation of the sequences of deduced propeptides, with only two amino 

20 add (aa) substinidons at portions -11 in humans vs. Val in bodi 
monkeys) and -2 (Lea in humans vs. Pro in both monkeys). Ex|>resslon of 
Cynomolgus monk^ Epo gene in culuied mammalian cells resulted in die 
production of mature monkey Epo ftat was dongated at the N-terminus by 
three additional residues: Val-Pro-Gly (Un, F.-K. ef of.. Gene 44:2Q\-2W 

25 (1986)). As the Rhesus monkey has die same substitutions, an identical ate 



' Several PCR attempts on total RNA purified from die horse's Iddney 
were unsuccessful, even when iiang 5' and 3' ^ific equine primers ^P), 
presumably because analysis on 1% formaldehyde agarose gel demonstrated 
the degradation of the RNA. 
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of cleavage is likely. The Leu lo Pro amino acid replacement ai -2 is 
probably responsible for the differential activity of the signal peptidase 
observed between man and monkeys. The three rodents, mouse, rat, and 
hamster, were found to have identical propeptide amino acid sequences. 

Mature protein. A comparison of the three previously published and 
our five deduced amino acid sequences of mature Epo proteins is shown in 
Figure 6. Erythropoietin is highly conserved among mammals. The amino- 
acid alignment showed that more than 63% of the molecule is composed of 
invariant amino acids (106 residues) (Figure 6A). Most of the observed 
substitutions are conservative, involving residues with similar physical and 
chemical properties. The calculated percentages of sequence identity are 
shown in Table II below. 



IS 



20 



TABLE n 
Degree of conservatf on among liianunab 
of the mature ^ proteins 

The percentages of identity among the various sequences were 
determined from the amino acid alignment reported in Figure 6. 



Primates 


Man/Rhesus Monkey 


90.5 




Rhesus/Cynomolgus 


98.8 


Rodents 


man/mouse 


79.8 




man/rat 


82.1 




mouse/rat 


93.5 


Artiodactyls 


man/sheep 


81.0 




man/jpig 


81.6 




sheep/pig 


88.7 


Carnivores 


man/cat 


84.5 



% identity 



Hieie is 100% consnvadon of Cys7 at the N-terminal portion of the 
polypqitide md Cysl61 tear die C-tenninus. Hus disulfide bridge is essendal 
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to the formation of a stable and functional cytokine (Wang. F.F. etal., 
Endocrinoloiy 776:2286-2292 (1985)). In contrast, while Cys29 is invariant 
in all the species we examined. Cys33 is presem only in primates, sheep, pig 
and cat, but not in the two rodents where there is a proline. TTie lack of 
5 funcrionalimportanoeofthisshortdisulfideloopisunderscoredbymuteinsin 

which tyrosine substitutions at each or both of these sites have no effect on 
^'s biological activity (See Example II). 

Theie is also 100% conservation of all three asparagine residues 
responsible for N-Iinked glycosylation. Even though Epo. deprived of its 

10 carbohydiate either by enzymatic cleavage aakeuuchi. M. et ai, J. Biol. 
Chem. 265:12127-12130 (1990)) or by production in bacteria (Nahri, L.O. 
et al.. J. Biol. am. 266:23022-23026 (1991)) has fiill in vitro biological 
activity, survival of Uie hormone in tiie circulation depends upon N-Iinked 
glycosylation (Spivack. J.L. et al.. Blood W:90-99 (1986)). In contrast, the 

15 Q-Unked glycosylation site (Serl26 in hunun Epo) is not essential for 
biological function since it is missing in nMuse and lat 

As shown in Figure 6. tiioc is very high conservation of sequence in 
regions which, by algorifluns based on primary strucmre. are predicted to be 
alpha helices (See Example 11). The few difffcrenoes in sequence widun these 

20 helical regions are conservative replacements. Indeed, some sites withia Ifaese 
predicted alpha helices are conserved in corresponding helices of oiiwr 
cytokines (Manavahin. P. et al., J. Prof. Oiem. iJ:321-331 (1992)). In 
contrast, regions predicted to be inter-helical loops are less well conserved, 
particularly die 14 residue stretch bewecn residues 1 16 and 130 of human 

25 Epo. In Example II a mutein with a deletion of leadues III througih 119 is 
shown to have normal stability and biological activity, whereas a deletion of 
residues 122 tiitough 126 feiU to produce a detectable protein, probably owing 
to markedly impdred stability. 

Similar overall 4 o-helical bundle structure exists or is predicted for 

30 several otiier cytokines/hormones, such as growUi hormone (GH), granulocyte- 
macrophage colony-sdmulatingfactor (GM-CSF). IL-3. IL4 and E^5 (Bazan. 
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J.F. et d.. Immunol. Today 11: 350-354 (1990)). Like Epo. GH exhibits a 
high degree of primary sequence conservation and biological crossreactivity 
between mammals. The mouse, rat, pig and sheep proteins show about 80% 
amino acid identity with human GH. In other cytolcines. however, there is 

5 more sequence diversity among species. For example, the human and murine 
forms of IL-5. GM-CSF and IL-3 show respectively only 69. 67 and 2695 
amino acid homology. Furthermore, unlike Epo or GH. those proteins lack 
inter-^>ecies biological cross-reactiviqr. 

Expmsion of the Mammalian Epo Homoaes. Mammalian Epo 

10 cDNAs were subcloned into the pSGSpIasmid and transiently expressed in the 
monkey Cos? cell line. Supcmaonts of the transfected cells were able to 
sustain the cellular proliferation of the murine ^xnlqwndent HCD57 cell line, 
demonstrating the biological cross-reactivity between species. As flie human 
^ antiseia used in our radioimmunoassay bind to ^ from other species 

15 with variable affinity, die amount of produced protein was difficuU to 
determine accurately. Experiments are in progress to further characterize the 
relative affinities of the various mammalian eiythiopoictins toward murine and 
human Bpo recq>t(»s. 

Phylogenetie Andyses of Sequences. Nucleotide sequences 

20 encoding tiiefiUllengtii mature Epo protein were analyzed by the maxinnm 
parsimony metiiod (Crclusniak. J. et d.,MeA. Emymol Ji83:fi01-615 (1990); 
Stanhope. M.J. et d., Mol Phybg. £w/. 7:148-160 (1992)). An "all trees- 
set of computer programs determined die paramoiQr lengtii of eadi of the 945 
unrooted trees formed by tiie seven cDNA sequences (die terminal taxa) 

25 (Figure 4) and, on ordering tiiose 945 trees according to increasing lengfli. 
Identified tiie minimum number of extra nucleotide substiditions needed to 
break up each group in tiie maximum parsimony (lowest lengdi) tree. As 
shown in Figure 7 and 7A, tiie mouse and rat are strongly grouped, as are flie 
human and monkey. The pig and sheep are more weakly grouped, in accord 

30 with accepted phylogenetie views, which place die divergence of diese animals 
about55-60 million years ago. As expected witii die parsunony reconstruction 
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on only the codon sequences, silent base substitutions occurred much more 
readily than amino add replacements (Figure 7B). 

The Epo intronl^on 2 sequences shown in Figure 3 were also 

analyzed by the maximum parsimony method. Ftgutt 8 shows strength of 
5 grouping results based on all 135.135 unrooted trees formed by nine terminal 
taxa (in this case, cat and lion sequences, which are nearly identical, were 
treated as a single taxon). and Figure 8A shows a pbylogenetic tree that- 
combinesmaximumparsimonywithestablishedphylogeneticevidence. Asthe 

n,ost parsimonious tree (Figure 7) for these relaUvely short sequences felled- 
10 to join dog to die feloid (cat and lion) branch (reflecting the fact that the 
ancestral split between canoids and feloids traces back to the «irly camivotes 
at about 60 million years ago), the pbylogenetic tree shown in Figure 7A is a 
near most parsimonious uee in which dog joins the feloids. The rodents 
(n^ouse and hamster) are strongly grouped, as are the feloids (cat and hon). 
15 and at more moderate strength the cetacean (dolphin) is grouped with the 
artiodactyls (sheep and pig). TT^re arc other molecular data as weU as some 
paleontological data that depict cetaceans originating from eariy artiodactyls 
(C^elusniak. J. et al. Meth. EnzynuA. J«:«)W15 (1990); Czelusniak, J. 
et al.. In "Currem Mammology". Volume 2. 3rd ed.. H. H. Ganoways. 
20 pp 545-572. Plenum Publ. Corp. (1990); Irwin. D.M. et d.. L. Mol 
EwL 52:128-144 (1991): Gingreich, P.D. et aL, Sdence2^.\5A-l51 (1990); 
Novacek. M.J.. Nature 356:121-125 (1992)). 
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EXAMPLE II 

Structure Function Relationships of Erythropoietin: 
MuteinsthtUTestaModelof Tertiary Structure 

Summary 

5 On the basis of its primary sequence and the location of its disulfide- 

bonds/ we prqK)se a structural model of the erythropoietic honnone 
erythropoietin (Epo) which predicts a 4 a-helical bundle motif, in common 
wiA other cytokines. In order to test this model, site directed mutants were 
prepared by high level transient expression in Cos? cells and analyzed by a 

10 radio-immune assay and by bioassays utilizing mouse and human Epo 
dqwndent cell lines. Deletions of 5 to 8 residues w&in predicted a helices 
resulted in tiie fiulurc of export of the mutant protein from the cell. In 
contrast, deletions at the N-terminus (Al-6), the C-terminiis (A163*166)i or in 
predicted intcrhelical loops (AB: A32.36, A53.57; BC: A78-82; CD: 

15 Alll-119) resulted in the expon of immunologically delectable Epo mutdns 
that were biologically active. The mutein A4»-52 could be readily detected 
. by RIA but had markedly decreased biolq^ activity. However, 
replacement of each of tiiese deleted residues by serine resulted in 1^ 
mutpins with fidl biological activi^. Replacement of Cys29 and Cys33 bjf 

20 tyroaw residues also resulted in the eKpon of fully active Bpo. Therefore dtis 
sn^l disulfide loop is not critical to Epo's stabiliQr or function. The 
properties of die muteins tfiat we tested are consistent witii our proposed 
model of tertiary strucnire. 
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Introduction 



Humoral regulation of red blood cell production was first prt)poscd at 
the beginning of tiiis century (Camot. P.. etal.. CR. Seances Acad. 
Sd. i«:432-35 (1906)). Convincing physiologic experiments documenting 
the existence of erythropoietin (Epo) (Krumdieck. N.. PnK. Soc. Exp. Biol. 
Med54.U-n (1943); Reissmami. K.R.. B/a«/5:372-380 (1950); Erslev, A.. " 
Blood 5:349-357 (1953); Jacobson. L.O.. et at.. Nature 779:633-634 (1957)) 
were followed by its purification (Miyake. T.. etal.. J. Biol. 
Oiem. 252:5558-5564 (1977)) and partial structural characterization (Lai. P.- 
H.. et al.. J. Biol. Chem. 267:3116-3121 (1986)). The molecular cloning of 
this biologically and clinically important cytokine (Jacobs. K., etal.. 
Na/urt 575:806-810 (1985); Un. F.-K., etal.. Proc. Nat. Acad. Sd. 
USA 52:7580-7585 (1985)) has led to further understanding of its properties 
(Sasaki. H., etal.. J. Biol. Chem. 262:12059-12(^76 (1987); Davis. J.M.. 
15 er a/., B/ocfcon/j/O' 26:2633-2638 (1987)). 

The binding of Epo to Its cognate receptor (D* Andrea, A.D.. et al.. 
CeU 57:277-285 (1989)) on erydiroid progenitors in the bone marrow results 
in salvagmg these cells from apoptosis (Koury. M.J., et al.. 
Sdflice 248:378-381 (1990)). allowing them to proliferate and differentiate 
20 into drculafing erytiuocytes. Tbt Epo receptor is a member of an ever 
enlarging Wy of (Ttottne receptors (Bazaii. F.. ftVK^ 
Gwimiw. iW:788-796 (1989)). In like mam»er. Epo shares we^ 
homology with other members of a fimiily of cytokirKS which also include 
growth honnone. prolactin. IL.2. IW. IM. IL-S, 11^. IL.7. GCSF. OM- 
25 CSF, M-CSF, oncostatin M, leukemia inhlbltoiy fecior and ciliary 
- neuri»opbic fkctor (Pany. D.A.D... et al.. J. Mol. Recognition 7:107-110 
(1988); Baan. Immunology Today 77:350-354 (1990); Manivalan. P.. 
etal.. J. Protein Oiem. 77:321-331 (1992)). The genes encoding these 
proteins have similar numbers of exonsas weU asa dear relationship between 
imron-cxoD boundaries aiid predicted «-helical stnictaie. These similarities 



30 
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. haveledtothcprediaionthatAisfamUyofcytokincsshareacomm^ 

of folding into a compact globular structu«^ consisting of four amphipath.c 
a-hclical bundles. Such dveorttical models of Aes«ucn«es of human growth 
hormone (Cohen. F.E.. eial„ Protem: Struct. Func. Genet. 2:162-166 

5 (1987)) and llA "(Curtis. B.M., etal.. Proteins: Strua. Func. 
Genet. i/:lll-119 (1991)) have been in remarkably good agreement w.tii 
subsequentsti.cturesestablishedbyX-raydiffraction(HGH)(Abdel-Meguid. ■ 

S.S.. et al.. Proc. Natl. Acad. Sci. USAU.Mi^^l (1987): deVos. A.M., 
etal Science 255:306-312 (1992)) or by multidimensional NMR (MrA) 
10 (Redf>eld.C...rd.,B/od.em,-5ro'i0:11029-11035(1991):Powe«^^^^^ 

Science 256:1673-1677 (1992)). Moreover, the crystal structures of GM-CSF 
(Diederichs. K.. et al.. Science 254:1779-1782 (1991)) and monomeric M- 
CSF (Pandit, J., era/.. Sdenc. 258: 1358-1362 (1992)) are also in reasom^^^ 

agreement with dieir predicted structures. 
15 Thus for. the strucnire of Epo has not been analyzed by eitiier X-iay 

diffraction or by NMR. In order to begin to gain an understanding of 
structure-fimction relationships, we have taken a tiiree-pronged approach: 

a) Sequence determination of Epo from mammals of different 
orders in order to «tabl|sh regions of homology. This woric is described in 

20 Example I. 

b) Construction of a model of the tiiree-dimensional structure of 

Epo! followed by die design and preparation of muteins flat test tins model, 
expcrimems are presented in thb Example. 

c) Design and testing of muteins that provide information on 
25 receptor binding domain(s). Hiis work b presented in Example ffl. 
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Materials and Methods 



Computer-based Modeling of Structure 

Prediction of Secondary Structure. Epo sequences from human, 
monkey, mouse, lat. sheep, pig. and cat were aligned (See Example I), and 
5 examined using a hierarchical approach to secondary stnicmrc predicuon that 
assumes that these proteins are members of the «/a folding class (Uvitt. M. 
a d.. N<u««r 267:552-558 (1976)). First, the pattern-based method of Cohen. 
F.E. et d.. Biochendstry 25:266-275 (1986) for turn prediction was used to 
delimitsequenceblockslikelytocontainsecondary structure. Predictions using 

10 the methodsof Gamier. J. et d,. J. Mol. Biol i2(h97-120 (1978) and Chou. 
?y etd.. Am. Rev. Biochem. 47:251-276 (1978) suggested a-hehcal 
regions within these blocks. Rnally. heUcal wheel projections were used to 
examine and then limit helix length based onpieservingamphipathic character 

as codified In the work of Presnell. S.R. et d.. Biochendstry 5i:983-993 
15 (1992). The locations of glycosyhition sites were also used to suggest helix 
boundaries. 

TertiaryStrueturePredictipn. Eariier investigations have revealed the 
general principles of helix-to-helix packing in globular proteins (Richmond. 
TJ. et d.. J. MoL Bid. iJW37-55 (1978)). Exploring these principles. 

20 Co^ F.E. et d,. J. Md. Bid. M2:275-288 (1979) developed a mellMd for 
tije generation of three-dimensional protein structures fkom the secondary 
strucmre assignment. These medwds have been applied to myosin, tobacco 
mosaicviruscoatprotein. growtii hormone, a- and p-interfeion.II^2.andIL4 
(Cohen. F.E„ er d., J. MoL Biol. JJ2:275-288 (1979); Cohen. F.E.. et d., 

25 Sdence 234:349-352 (1986); Cohen. F.E.. et d.. Proteins: Struc. Func. 
(^.2:162-166 (1987); Sternberg. M.J.E.. etd.. Int. J. Biologicd 
Maamolecdes 4:137-144 (1982)). 
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The algoriJhm for tertiary structure generation is divided into four 
computations. The prtjgram aapatch identifies clusters of hydrophobic residues 
within the putative helices that could mediate helix-helix interactions 
(Richmond. T.J.. et aL, J. MoL Biol 7i9:537.55 (1978)). Aafold generates 

5 all possible helix pairings according to the location and geometric preferences 
oftheinteractionsites. AabuUd generates the thiee-dimcnsional models oif all 
possible strucwres from the list of helix pairings (from aafold) and subject to 
stearic restrictions and geometric consiraims on chain folding. In the final 
step, aavedor applies the user defined distance constraints (e.g., disulfide 

10 bridges) to *e structures generated. At this stage, coordinates have been 
specified only for residues in the core o-helices. For residues in sequentially 
distinct loops, lower bounds on the inter-residue distancescan be inferred from 
the relevant helix terminus. 



Pnparttdon ofEpo Muteins 

15 anamcthnoftheMutdgenk/MammaUanExpnssbttPlasmid. A 

M13 plasmid. containing a 1.4 Kb EcoRI-EcoRI human Epo cDNA Insert 
OJlEpo FL12) was a gift from Genetics Institute (Cambridge, MA) (Jacobs. 
K.,.rt fl/.. Nature 373:806-810 (1985)). A 943 bp EcoRI-Bgffl fragmem. 
corresponding to the complete coding sequence of the wild type human 

20 cijihiopoletin. including untranslated regions 216 bp upstream and 183 bp 

downstream, was inserted into the mammalian expression plasmid pSG5 

(Stratagene) (Sambiook. J., a al.. In "Molecular Cloning: A Laboratory 

Mamal", 2nd ed.. Cold Spring Hartor Laboratory. Cold Spring Harbor. NY. 

pp. 1.38-1.41 (1989)) and named pSG5-Epo/WT. 
25 site-DlrededMutageneAwascarriedoutaccordingtotheprotocol 

described by Kunkd, TJL, et aL, Meth. EiuymoL ISd-JSJ-m (1987). 
Single-soanded DNA was rescued from the pSGS-EpoWT phagemid grown 
overnight in £. coB CI236, in 2XVT media containing M13K07 helper phage 
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(In Vitrogen) and 70 /tg/ml kanamacyn (Sigma). The resulting uiacil- 
containing ssDNA was used as a template for mutagenesis. Oligonucleotides 
(24 to 46 mer) were synthesized with dieir 5' and 3' ends complementary to 
the target wild type Epo sequence. A large variety of mutations (base 
substitutions, deletions and insertions) were created at tiie centers of tiie 
mutagenic primer sequences. Annealingoftiiephosphorylated primers (10:1 
oligonudeotide/DNA template molecular ratio) was performed in 10 /il of a 
20 mM Tris-HCl pH 7.4. 2 mM MgQ,. 50 mM NaCI solution. The reactions 
weie incubated at 80'C for 5 minutes, tiien allowed to cool slowly to room 
temperature over a one hour period. The DNA polymeriiation was initiated 
by the addition as a mbt 6f 1 pi of lOX symhesis buffer (100 mM Tris-HCL 
pH 7.4. 50 mM MgCl,. 10 mM ATP. 5 mM each dNTPs, 20 mM DTD, 0.5 
,tl (8 ututs) of T4 DNA ligase and 1 ,il (1 unit) of T4 DNA polymerase 
(Boehringcr Mannheim). After2hoursat37«'C. 80,ilof IX TEwasadded. 
15 5 |d of die diluted reaction mix was used to transform competent E. coli 
NM522 (ung*. dttt*). 

Since a 40^80% mutation yield is normally obtained, 4 to 5 double- 
stranded plasmid clones from each reaction were sequenced with T^leaza- 
dCTP and Sequenase (U.S. Biochemical) (Sanger, F., etal., Proc. Natl. 
20 Acad. Sd. USA 7*5463-5467 (1977)). As a rule, the entire coding sequences 
of the Epo mutants were exainined for the present* of uiiwanted mutation ^ 

sequendng dr lestriction oizyme nuking. 

PnductionofWldI)fpeEpoaad^Mtttein8inMammiaianCdlk 

Cos! cells grown to approx. 70% confluency were transfiscttd wiiJi 10 jtg 
25 recombinant plasmid DNA per 10 cm dish using tbe calcium phosphate 
precipitation protocol (Kingston. R.E., etal.. In "Current Protocols iii 
Molecular Biology". Green Publishing Associates & Wiley Interscience, NY, 
pp. 9.1.1-9.1.9 (1991))- As a control of transfection efficiency, in several 
experiments 2 ng of pCHl 10 plasmid (Pharmacia) was co-tnmsfected and p- 
30 galart»«M»«^ actiw^ measured in tite qrtoplasmic extracts. 
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RNA Blot-Hybridization Analysis. Total RNAs were prepared from 
cultuied Cos? cells (Chirgwin. J.M.. etal., Biochenustry 78:5294-5299 
(1979)) and 2 /ig samples electrophoresed on a 1.1% agarose gel containing 
2.2 M formaldehyde. Transfer to GeneScrcen Plus filters (New England 
5 Nuclear) and hybridization with a «P-Iabeled wi Epo probe were carried out 
as previously described (Faquin. W.C.. et d.. Blood 79:1987-1994 (1992)). 

QuantUatiott of Timsiently-Expnssed Recombinant Epos. The 
amount of secreted protein in the supematants of transfectcd Cds7 was 
determined by a radioimmune assay (RIA). TTie RIA was performed using a 
10 high titer rabbit polyclonal antiserum raised against the human wild type Epo 
and produ<^ in our laboratory. '"I-bbeled recombinant Epo was obtained 
from Amersham. Details of the protocol have been published elsewhere 
(Faquin, W.C.. et al.. Exp. Hmatol. 21: 420426 (1993)). 

Immunoprecipitation of "S-labeUd Epo Proteins. ThtuM^zfttT 

15 transfection. the Cos? monolayers were washed extensively with IX PBS and 
the cells incubated for 20 minutes at ZTC in 2 ml of Mer/Cysr Miniiiium 
Essential Media Eagle modified. In each culttiie dish, 100 fil of TRAN35S^ 
LABEL (p*S]-cysteine and ("5]-methionine. ~ 10 mCi/ml, ICN Biochemical) 
was then added. After two hours, the conditioned media were harvested and 

20 celluhr extracts were prepared by lysis in radioimmune precipitation buffer 
(50 mM Tris-HCI pH 8.0. 150 mM NaQ. 0.02% (w/v) sodium aride. 0.1% 
(w/v)SDS,0.5% (w/v) sodium deoxycholate, 1% (w/v) Triton X-100, 1 mM 
pheiiylmethancsulfonyl fluoride and 1 ftg/ml aprotinin). Samples were 
piedeaied widi rabbit preimmune serum/protein A-Sq)haiose CL-4B 

25 (Phannacia) for two hours. Immunoprecipitations were performed overnight 
vMi our polyclonal antibody ^fic for human recombinant wild type Epo 
and immunoadsoibcd with protein A-Sepharose CL4B. Immunoprecipitaies 
were ran on 15% SDS^wlyacrylamide gels (Lacmmli. U.K.. Nature 
227:680-685 (1970)) and analyzed by autotaUogr^hy after treatment with 

30 Enhance (New &)gland Nuclear). 
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Bbassays. The dose-dependent proliferation activities of wild type Epo 
and Epo muteins were assayed in vitro using three different target cells: 
murine spleen cells, following a modification of the method of Krystal 
(Krysial. G.^Exp. HematoL 77:649-60(1983); Goldberg, M.A.. et al., Proc. 
5 Natl. Acad. Sd. USA W:7972-7976 (1987)); Epo rc^nsive murine 
erythroleukemia cell line, developed by Hankins (Hanlcins, W.D., et aL, 
Blood 70: 173a (1987)); and human Epo-dependent ITT-l/Epo cell line, derived • 
from the bone marrow of a patient widi acute megakaryoblastic leukemia 
(Komatsu, N., et al.. Cancer Res. J7:341t348 (1991)). After 22 to 72 hours 
10 of incubation widi increasing amounts of recombinant protdns, cellular giowih 
was determined by pH]-thymidine (New England Nuclear) uptake or by the 
colorimetric MTT assay (Sigma) (Yoshimura, A., et aL, Proc. Natl. Acad. 
Sd. USA 57:41394143 (1990)), 

Bacterial Expression. The wild type Epo target, correspond!!^ to the 
IS nucleotide sequence coding for the mature protein, was PCR-amplified using 
appropriate primers. In the sense primer an Ndel site (CATATG) was placed 
inunediately 5' to the Alal oodon of the mature protdn. In the antisense 
primer a Bglll site was placed 3' to the TGA stop opdon. After enzymatic 
digestion, the 516 bp PCR fragment was inserted in an Ndel/BamHI-cut 
20 pETl^ plasmid (Novagen), which has a T7 promoter followed immediately 
by the lac operator. IPTG induction of transformed E. coll BL2 1 (DE3) (T7 
RNA polymerase^. Ion', otnpT) resulted in high levels of expression of a 
fusion protein with a ten hisddine stretch at the amino terminus. The oligo- 
His tag allowed the binding of the produced (HiSu,)*Epo on a nickel affinity 
2S resin and its elution by increasing imidazole conoentradons in presence of 
phenylmethylsulfonyl fluoride (Sigma). Most of the produced protein formed 
insoluble aggregates and was solubilized and afHnity-purified under denaturing 
conditions in 6 M guanidine-HCL Oxidative refolding was performed by 
overnight dialysis at 4^C against SO mM Tris-HCI pH 8.0, 40 /iM CUSO4 and 
30 2% (wt/vol) Sarkosyl. Soluble protein was fiinher dialyzed against 20 mM 
Tris-HQ pH 8.0, 100 mM NaC3, 2 mM CaCI, and subjected to factor Xa 



wo 94/24160 



PCT/US94/04361 



-51 - 

(New England Biolabs) cleavage to remove the NHj-polyHis sequence. 
Monitoring of the fusion protein following induction and during the various 
steps of purification was done by electrophoresis on a IS % polyacrylamide- 
SDS gel, stained with Coomassie Brilliant Blue. Aitematively. the His-Epo 
5 fusion protein was detected on Western Blot (Gershoni, J.M., et ai, Arud. 
Biochem. 131:IA5 (1983)), using a 1/2000 dilution of our wt native Epo 
polyclonal antibody and a second biotinylated rabbit-specific antibody which * 
is detected with a streptavidin-alkaline phosphatase conjugate (Amersham). 
In Viiro Transcription/Translation. Sense and antisense primers, 

10 creating new Bglll sites respectively S' and 3' of the initiator and stop codons, 
were used in a polymerase chain reaction on pSGS-Epo/WT template. After 
Bglll cleavage, the 594 bp PCR-fragmenl was subcloned into p SP64T (Krieg, 
P.A„ et al., NucL Acid. Res, i2:7057-7070 (1984)). This SP6 containing 
vector provides 5' and 3' flanking regions from Xenopus ^-globin mRNA, 

IS which allow efficiem in vitro transcription/oimsladon. Previous experiments 
showed poor yields of in vitro translated protein, when using the GC-ricb 
natural S' 1^ untranslated region (UTR). One step in vitro transcription/ 
translation was carried out by incubation of 1 ;ig of circular p64T-Epo in a SO 
/il reaction volume of SP6-TnT coupled rabbit reticulocyte lysate sy^m 

20 (Prpmega), in presence of (^5]-cysteine (1200 Ci/mM, New England 
Nuclear). In some cases, canine pancreatic microsomal membranes were 
added to the reaction mix. A purified GST-human ^ receptor extracellular 
domain fusion protdn (EREx) was a gift from Harris and J. Winkelman, 
and the binding of the ^-labeled translation products onto EREx-glutatiiione 

2S agarose beads were performed as described (Harris, K.W., ei al., /. BioL 
Cton. 267:15205-15209 (1992)). 
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Results 

Construction of a Model of the Thne-Dimendonal Structure of 
Erythropoietin 

From an analysis of the putative. Epo helix sequences, aapatch 

5 identified eight possible helix-helix interaction sites. In principle, these sites 
could be used to generate 1.6 x10* strucnires. Of these, only 706 maintained 
the connectivity of the chain and were sterically sensible. These structures 
resembled four helix bundles, an increasingly common motif in protein 
structure (Presnell, S.R., et al. Proc. Natl. Acad. Sd. 5(5:6592-6596 (1989)). 

10 The structures that were not compatible with the native disulfide bridge 
between Cys, and Cys,„ were eliminated. This reduced the total number of 
structures from 706 to 184 (total computer time approximately 1 hr. on a 
Silicon Graphics IRIS 4D/35G). The remaining structures were then rank 
ordered by solvent-accessible surface contact area, a measure of the validiQr 

15 of model structures. The most compact structures were right handed, all anti- 
parallel four-helix bundles with no overiiand connections, but this may be an 
artifact of a feUuie to add the polypeptide chain that forms the Ioq)s to the 
helical core constructed by aabiuld. The other less con^ structures were 
left^ianded four-hdut bundles wiA two overhand kx^s, a tqwlogy previously 

20 seep in the structuies of IL4 and growUi hormone. We suqiect that this is the 
likety stnoturc for Epo, The consensus for assignments of putative a-hclices 
in buman Epo are summarized in Table III. First, analysis of the topological 
distribution of known four-helix bundle structuies indicate that nearly all 
examples have an antiparallel orientation (Presndl. S.R., et cL, Pmc. Natl. 

25 Acad. Sd. 56:6592-«596 (1989)). Second, die lefl-handed fbur-belix bundles 
wtdi two overiiand connections arranges die four anqihipaifaic helices to form 
a compact hydrophobic core. Finally, a padi for die Io(q> regkms of ^ can 
be imagined by analogy to IL-4 and growdi hormone diat preserves the 
compact globular nature of die Epo model structure. Figure 9 shows 
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schematic representations of the possible topological interactions between the 



four-anti-parallel a-helical bundles. 







Table III 




Predicted a-helicai regions of the mature erythropoietin protdn 


Helix 


N-terminus 


C-terminus 


Potential Hdix-Helix 
;lotencd(» Sites 


A 


9 


22 


19 


B 


59 


76 


63. 67. 70. 71 


C 


90 


107 


95. 102 


D 


132 


152 


141 


Data were obtained using the various algorithms for secondary and 
tertiary structure generations described under Materials and Methods. 



Several authors have suggested that the helical cytoldnes form a 
structural superfamily (Cohen, RE., etal., Sdence 254:349-352 (1986); 
Pany, D.A,D., etd,. J. M0I Recognition J: 107-1 10 (1988); Bazan. J.F., 

15 Biochem. Biophys. Res. Comm. 164:10-195 (1989); CJosman. D., etal.. 
Trends Biochem. Sd. 75:265-270 (1990); Bazan. J.F., Proc. Natl. Acad. Sd. 
U.SA. 87:6934-6938 (1990)). On the basis of both die mature protein and die 
individual a^elioes, Epo seems to be more closely related to growth hormone, 
prolactin, llr6 and GM-CSF radier dian the other members of die helical 

20 cy&)Idne super&mily. Nevertheless, recent in^rovements in algoridims for 
die identification of distant evolutionary relationships between proteins from 
structural fingerprints suggested that it might be possible to align the IL-4 
structure to die Epo sequences. The Esenberg (Bowie, J.U., et al., 
Sdence 253:16^-110 (1991)) stnicmral environment and 3D-1D profile 

25 mediods are a powerfiil tool for recognizing diat a sequence is conqiatible widi 
a known structure, e.g., a four-helbc bundle. The NMR structure of IL4 
from Smidi, LJ., eial.. J. Mol Biol. 224:899-904 (1992) was used to 
construct a 3D-1D profile. A mixture of sequences including four helut 
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bundles, globins, and non-helical structures were aligned against the IL4 
profile. Not surprisingly, the IL-4 structures from human and mouse gave the 
highest scores (2? = 22.8 and 8.1). However, the other known four-helix 
bundle cytokines known to share a similar fold with IL4, c.g.. human growth 

5 hormone (Abdel-Meguid. S.S., et aL, Proc. Nail. Acad. Sd. 8^:6434-6437 
(1987)) (Z = 2.3) and GM-CSF (Diederichs, K., etal., Science 
254:1779-1782 (1991)) (Z = 2.3) fared no better than some globin sequences 
(Kuroda's and slug sea hare globin. Z = 5.0 and 4.8) that adopt a distinct 
tertiary structure. The results for the human and sheep Epo sequences were 

10 also ambiguous (Z = 1.6 and 0.8). These results suggest that while profile 
methods are a powerful tool for recognizing structural similarity, their fwlure 
to identify homology does not exclude the possibility that two proteins share 
a common fold. For distantly related or unrelated structures, current profile 
methods cannot r^lace de novo methods for tertiary structure prediction. 

15 DedgnandExpnssionofEpoMuteinsthatTeaAePmposedStnscttt^^ 

To test the proposed four a-helical bundle structure of erythropoietin 
and at the same time to attempt to locate functional domains, we created by 
site-directed mutagenesis a series of deletion, insertion and replaoement 
mutants. These muteins were designed to analyze the prin^al predicted 
20 stnictural feamres of die molecule: a-helices, interconnecting loq)s, as well 
as the NHj and CCX)H termini. Structural and functional implications of die 
disulfide bridges aiMl die glycosylation sites were also investigated. 

a Helices. Short amino acid deletions were prq)ared in, or close to, 
die predicted A, B, C and D a helices. Human wild typt and muteins were 



25 ^scores are used to describe die normalized weight associated widi a 
profile score. A distribution is built from a collection of sequences with a 
mean 2 score of 0.0 and a standard deviation of l.O. Z-scorcs greater dian 
6.0 arc associated with significant alignments. Z*scores between 3.0 and 6.0 
may or may not be structurally relevant. 
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transiently expressed in Cos? cells. Northern blot analyses demonstrated that 
all the mutant plasmids produced about the same amount of mRNA as that of 
the wild type (data not shown). Yet, no detectable amount of Epo protein 
could be found in the Cos? supematants, either by radioimmunoassay or by 
bioassay usitig various Epo-dependent cell lines. Table IV summarizes these 
findiiigs. 

TABLE IV 
Shon ddedons in, or close to, a helices 

mutants. 
A 12-16 



A 65-69 



A 96-100 
A 105-109 



A 122-126 

A 131-135 
A 140-144 
A 142-150 
A 152-155 

A 156-160 



9 
I 

22 



59 
4 

76 



90 
1 

107 



A HELK 



B HELK 



C HELIX 



132 
152 



D HEUX 



- RNA levels conq)aiable to WT. 

- no detectable Epo in the Cos? 
supernatant, both by RIA and 
bioassay. 



Predicted N and C tennioi of each a helix are indicated in the vertical boxes. 



An example of SDS-PAGE of immunopiedpitants from in. vivo ^S- 
labeliiig is presented in Figure 10. As eqiected, ^n Cos? odls were 
transacted with i>SG5-Ei)o/WT, a 35-37 KD band was detected in the 
supernatant In contrast, the deletion mutants (Table TV) could be detected in 
5 cellular extracts but were not nqraited from the cells. Figure 10 shows the 
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cytoplasmic retention of the mutein A140-144, lacking four residues in the 
middle of the predicted D helix. The apparent molecular weight (around 28 
IcD) is less than expected for a five-amino acid deletion. Therefore not only 
the secretion, but also the glycosylation, seem to be impaired. None of these 

5 mutcins had deletion of glycosylation sites. It is likely that fiill glycosylation 
of Epo requires conservation of its molecular architecture. Similar resulQ 
(reported in Table IV) were obtaned for all the muteins having partial deletion- 
of an o helical pq>tide segment. 

Because contaminants in crude Cos? cellular extracts severely interfere 

10 with the radioimmunoassay, no direct Epo quantitation was possible. . 
However, aliquots of hypotonic extracts of Cos? transfected witii wild type 
Epo were able to sijstain HDC5? proliferation. No similar biological activity 
was found for muteins wridi limited deletion of a helices. 

Ittterconnecdng Loops. The peptide segment joining A and B helices 

15 presents several interesting features. (Figure 11). AB loop consists of 36 
amino acids. TWO N-glycosylation sites and a anall disulfide bridge are 
located in die first half and their biological implications will be discussed later. 
The COOH end of tiie AB loop contains a stretch of amino acids that is 
strongly conserved among mammals (Sec Example I). Alignments of human, 

20 monkeys, cat, mouse, lat, pig, and sheep Epos showed a consensus sequence: 
DTKVNFYAWKR(M/I)(E/D)VG (residues 43 to 57) [SEQ. ID NO: 4J. 
Three deletions were constructed: A43-47, A48-52, and A53-5?, and 
transiendy expressed in Cos?. The amount of muteins detected by RIA in tiie 
supemaiants of oansfixted cells was 10 to 40% lower tiian observed wid» wild 

25 ^ Epo (Figure 11 A). Nevertiieless, flie fliree secreted mutcins were 
biologically active. HowevM-, because A48-52 exhibited a marked decrease 
of die specific bioactivity, diis site was snidied in more detail by means of 
serine replacements. KrystaJ a w w bioassay as well as HCD5? and 
VT'inepo in }4tm bioassays showed diat diese Ser mutants had biological 

30 acti^ties similar to tiiat of mid type (Figure IIB). Therefore, the observed 
decreases in bodi RIA and bioassay for the diree deletion mutants are. likely 
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to be the result of changes of structural conformation. The long length of loop 
AB may be critical for the up-up-down-d&wn topography. A shorter AB 
segment may impose a strain on the intcrhelical connection. Chou and 
Fasman algorithms (Chou. P-Y:. et al., Ann. Rev. Biochm. 47:251-276 

5 (1978)) predicted a short ^-shect structure from residues 44 to 51 (<Pa> ; 
<Pb>. 1.005 < 1.196). The presence of a short region of 0-sheet in the 
connection between helices 1 and 2 (A and B) have been documented in the- 
analyses of the three-dimensional structures of IL4 (Redfield, C. a dl.. 
Biochemistry 30: 1 1029-1 1045 (1991); Powers. R.. et al., Science 256:1673- 

10 1677 (1992)), GM-CSF (Diederichs; K., etal.. Science 254:Vn9-m2 

(1991) ). and monomeric M-CSF (Pandit. J., et al., Stfemre 25& 1358-1362 

(1992) ). In contrast, in human GH a short segment of a hela is found at the 
same location (Abdel-Meguid. S.S.. etal.. Proc. Natl. Acad. Sd. USA 
54:6434-6437 (1987)). The structure/function implications of tiiese short 

15 fsatures are not yet understood. 

Helix B is linked to helix C by a much shorter segment (residues 77 
to 89) and contains in its center the tiurd N-glycosylation site (Asn83). When 
the A78-82 mutein was expressed, a secreted protein was detected in the 
conditioned medium and conferred proliferative bioactiviiy on Epo^ependent 

20 cell lines (see Figure 16). 

.A sinular long crossover connection (23 amino adds) is found between 
helix C and helix D. In contrasiio what we previously observed for loop AB, 

a iarge deletion of mne residues at position 111-119 or a seven amino acid 
inseitioa of a myc epitope after residue 116 did not affect the secretion of 

25 diese muteins (Figure 12). Furthermore, these two proteins had normal 
specific activity, as seen by the ratio of bioassay to RIA. Our rabbit 
polyclonal antibody raised against the native form of the human wild type fully 
tecognized flie two mutants, demonstrating that tiie overall spatial 
conformation of Epo was weU preserved. According to >e algorithm of 

30 Emini. E.A., et aL. J. Virol. 55:836-839 (1985). the residues 111-119 are 
predicted to be at flie surface of die molecule. Primary amino add alignments 
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of mammalian Epo showed a large variation in the sequence of residues 1 16 
to 130. including amino acid deletion, insertion, and substitution (See 
Example!). Surprisingly, when the deletion A122-126 mutein. which , 
removed the 0-glycosylation site (Ser 126). was tiansiendy expressed in 

5 tnonicey cells, protein secretion was inhibited. Botii rodents, rat and mouse, 
lack dje O-glycosylation site because of a Serl26 to Pro replacement. 
Furthermore, when a ScrI26 replacemem mutein was expressed in normal • 
CHO cells. (DeLorme. E., etaL. Biochendstry iJ/:9871-9876 (1992)) or 
when wild type Epo was expressed in cells having a defect in 0-Iinked 

10 glycosylation (Wasley. L.C., etal., Blood 77:2624-2632 (1991)). neither 
secretion nor biological activity were impaired. Therefore, failure of secretion 
of the A122-126 mutein may be the result of some other structural alteration. 
In particular, the proline residue at position 122 is invariant among mammals. 
NandC TerminL Deletion of residues 2 to 6 only slightiy affected 

15 tije processing of a biologically active protein (see Figure 16). This deletion 
may impair cleavage of the propeptide, therefore explaining the lower yield 
of secreted Epo mutein in comparison to diat of wild type. The fact Aat the 
mature monkey protein has an elongated (Val-Pro-Gly) N-terminu$ strongly 
suggests tiiat the N-terminal part is not involved in the bioactivity of the 

20 molecule. Further evidence comes from ti\e results, repotted below, on the 
N-poly-His-^ fusion prwein expressed in £. eoff, and also from tiie idendcal 
binding of /« vitro translated »S-labeled wild type Epo onto EREX-glutadiione 
' agarose beads (Figure 13), witii or widiout addition of canine pancreatic 
microsomal membranes which permit cleavage of the propeptide.' 

25 The C-terminal sequence foUovwng helix D can dearly be divided into 

two distinct domains, separated by Cysl61. The residues 151 to 161 were of 
^al interest because tiiqr are highly conserved among mammals (See 
Example I). There are only two substitutions: Lysl64 replaced by a Thr in 



30 



?All die mutants described in this example were subcloned into pSPG4T 
plasmid. 
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aruodactyls and cat, and AIal60 is replaced by a Val in mouse Epo. Both the 
A152-155 and the A156-160 mutcins remained in the cytosol of the transfected 
Cos? (Table IV). One possible explanation is that the residues 152 to 160 
may, in feet, participate in the D helix. We predict that Glyl51 is the break 

5 point of the stnicture. However, it is possible that this residue causes only a 
bend in the a helical structure aiui helix D may extend to Gly 158. 

The C-ierminal part of the protein (residues 162 to 166) is clearly not- 
involved in any structural or functional feature. Thus, the deletion of tiie four 
last amino acids or die replacement of residues 162-166 by eidier a KDEL 

10 sequence or a poly-histidine sequence* did not modify tiie specific activity of 
the erythropoietin (Figure 14). Radioimmunoassay revealed fliat tiie secretion 
of the KDEL-tail mutcin in the rnedia of transfected cells was 45% less dian 
normally obtained with tiie wild type Epo. However, when compared to tiie 
wild type, tiiis mulein had more biological activity in tiie hypotonic Cos? cell 

15 extracts. The KDEL (XX)H-tenninal sequence has been riiown to be essential 
for tiie retention of several proteins in tiie lumen of tiie endoplasmic reticulum 
(Andres, D.A.. efo/., J. Biol. Oim. 266:14288-14282 (1991)). 
Neverdieless, because of overproduction in transientiy expressed cells, a large 
percentage of recombinant protein escaped into tiie media. 

20 IXsi4fide Bridges. Warig aal. demonstrated tiiat tiie biological 

actiyiiy of Bpo was Jost irreversibly if tiie sulfliydryl groups were alkylated 
(Wang, F.F.. et al.. Endocrinology iJ6:2286-2292 (1985)). The mature 
human ^ has two internal disulfide bonds: Cys7sCysl61, linking tiie NH, 
and CpOH termini of tiie protein and a small bridge between Cy^9 aiul 

25 Cys33. Cy^3 was previously cfaaiiged to Pro by site-directed mutagenesis, 
and tiie resulting protein was reported to have greatiy reduced in yiiro 
biological activity (Un, F.-K., Molecular md Cettidar Aspects (ff 



^Tbt poly-His tail wild type mutant was purified by means of nidcel 
affinity diromatography whidi will enable quantitation of cytosolic-retained 
30 mutants. The (His)6 COOH-terminal sequence has been appended to all tiie 
mutdns described in tiiis example. 
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Erythropoietin and Erythropoiesis H8:23-36, NATO ASl Series. Springer- 
Verlag. Berlin Hcidelbei;g (1987)). However, rat and mouse Epos have the 
same substitution and yet exhibit full cross-species bioactivity. To resolve the 
role of the small disulfide bridge in human Epo function, we created a 

5 C29Y/C33Y double mutation. The resulting muttin was normally processed 
and showed the same in wtro bioactivity as tfie wild type (Figure 11 A).' 
Furtiiermore. die deletion of five amino acid residues (A32-36) did not impair 
die secretion of a biologically active mutein. These data demonstrated tiiat only 
die native and fully conserved disulfide bridge Cys7=?Cysl61 is crucial for the 

10 preservation of die molecular structure of erydirqwietin. 

Functional Role of the Glycosyladon. Natural Or recombinant human 
^0 is a heavily glycosylated protein; 40% of its molecular weight is sugars 
(Davis. J.M.. et al.. Biochemistry 26:2633-2638 (1987)). TTic protein has 
tiitee N-linked oligosaccharide chains, located at amino acid positions 24 

15 and 38 On predicted loop AB) and position 83 (in loop BQ. It has one O- 
linked carbohydrate chain at position 126 (in loop CD), wWdi is missiag in 
rodents. The role of tiiese sugar chains in dw biological activity of die human 
hormone has been extensively studied. Site-directed mutagenesis at the N- 
. giycosylation sites demonstrated tiiat even tiiough die sugars were impoitant 

20 for proper biosyndiesis and secretion, their removal did not affiect in vitro 
activity. This finding was corroborated by several investigators (Yamagudu, 
K.. et al., J. Biol. Chem. 266:20434-20439 (1991); Dub6, S., et ai, J. Biol, 
dm. 263:17516-17526 (1988)). However, Takeucbl, U.,etal., J. Biol. 
Chem. 265:12127-12130 (1990) found tiiat N-glycanase digestion results in 

25 almost complete loss of biological activity. In contrast, tiiere is general 
consensus diat giycosylation plays a key role in die biological activity of die 
hormone in w w. Various rqwrts have demonstrated diat die N-Iinked sugar 
chains«nhance die stability and sundval of 1^ in die blood stream (Fukuda, 



^wo single iq)lacement muteins (C29Y and C33Y) were also stable and 
30 had full biological activity (results not timm). 
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M.N.. aal.. Blood 75:84-89 (1989); Tsuda. E., etal.. Eur. J. Biochem. 
iSS:405411 (1990)) and protect the hormone aganst clearance by the liver 
(Sasaki. H.. et al., J. Biol. Oiem. 262:12059-12076 (1987)). thereby enabling 
the transit of the hormone from its site of production in the kidney to its target 

5 cells in the bone marrow (Spivak. J.L.. et al.. Blood 73:90-99 (1989)). 

We expressed the wild type Epo in £. coU. Accordingly, the produced 
protein completely lacks sugar. The pET expression system was used and is 
detailed in the Methods section. IPTG induction of transformed 8L21 (DE3) 
bacterial strain rapidly results in a high level of expression of the poly-His Epo 

10 fusion protein (Figure 15A and 15B). After three hours of induction, we 
obtained a typical yield of -1 mg of total protein per ml of culture. 
However, the vast majority of the expressed protein was presem in the 
inclusion bodies and tiierefore its solubilization and purification on the nickel 
beads was performed in 6 M guanidine-HCl. Oxidative refolding and factor 

15 Xa cleavage resulted in soluble forms (Figure 15C) and the in vitro biological 
activity was tested on HDC57 cells. The cleaved E. coU recombinant Epo 
showed a notable decrease of Uie specific activity (10% less than tfie fully 
glycosylated mammalian expressed protein), but was still able to maintain 
HCDS7 proliferation. The observed reduction of in vitro activity is likely to 

20 be due to impioper refolding of the insoluble protein and impaired physical 
stability of the R coili Epo as previously reported (Narhi, L.O., a al., J. Biol, 
aim. 266:23022-73026 (1991)). However, the fact diat the £. <»ff Epo was 
still able to trigger HCD57 growA indicated an overall preservation of die 
molecdlar structure. The undeaved fusion protein, widi 10 His residues at the 

25 N-terminus. exhibited a 67% loss in biological activity when coinpared to the 
cleaved piotdn. Thus, the addition of a 20-icsidue sequence to the N- 
tnmiinis partially inhibited die biological activity. 
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Discussion 

Currently the accrual rate of new protein sequences through gene 
cloning for outstrips the rate of determination of three dimensional structure. 
Epo is among a large number of biologically important proteins which have 
5 not yet been analyzed by X-ray diffraction or NMR, The problem is simplified 
by cumulative evidence that the structures of most proteins are likely to be* 
variations on existing themes (Uvitt, M., et aL Nature 267:552-558 (1976)). 
Indeed, as mentioned above, Epo appears to share common structural features 
with a large group of cytokines (Parry, D.A.D., etd., J. MoL 

10 Recognition 7:107-110 (1988); Bazan, R, Immunology Today J7:350-354 
(1990); Manavalan, P.. et dL i. Protein Ckem. 77:321^331 (1992)). 

Computer-based prediction of strucnire can be reduced to a three stage 
process: secondary structure is predicted from the primary amino acid 
sequence and, when available, optical measurements. Analysis of Epo by 

15 circular dichroism reveals about 50% a helix and no detectable /J sheet (Lai. 
P.-H., etal., J. Biol Chem. 267:3116-3121 (1986); Davis, J.M., et al., 
Biochemistry 25:2633-2638 (1987)). With the knowledge of disulfide bonds, 
secondary structural elements are then packed into a set of alternative tertiary 
structures- The number of plausible arrangements can be reduced by 

20 empirical knowledge of preferred helix-helfat paddnig geometries and the need 
for globular structure to form a hydrophobic core. The putative tertiary 
structure is then refined by standard force field calculations. Since tiiere are 
a large number of alternate tertiary structures, tiie availabiliQr of 
e3q>erimentally determined structure of a homologous protein is critically 

25 iTsxponanL Thus tfie predicted model of Epo structure gains considerable 
validity by knowledge of die strucnires of GH (Abdel-Meguid, er al., 
hoc. Natl. Acad. Sd. USA W:6434^37 (1987); deVos. A.M.. eta!.. 
Science 255:306-312 (1992)) and llA (Redfield, C, etal.. 
Biochemistry 30:11029-11045 (1991); Powers. R., et al.. 

30 Mencf 256:1673-1677 (1992)). 
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We have tested the predicted four anti-parallel o-helical bundle 
structure by means of site-directed mutagenesis. Deletions within predicted 
o helices would be expected to destabilize tertiary structure, whereas deletions 
or insenions in non-helical segments should be permitted unless they impose. 

5 undue strain on Uic structure. For example, a deletion in an overhand inter- 
helical loop may result in insufficient length to connect the two helices. 
Results that we have obtained on muteins produced in mammalian (Cos?) cells- 
are summarized in Figure 16. Our measurements of the quantities of 
processed mutein by RIA may underestimate the true amount of secreted Epo. 

10 Even a small deletion or insertion can result in a conformational change diat 
may Ifcad to impaired binding by our polyclonal antibody, raised against native 
human Epo. Thus the values for specific activity (biologic acuvity/RlA) diat 
we report must be regarded as approximations. This caveat notwidistanding. 
our mutagenesis results are in good agreement witii our proposed four 

15 o-hcUcal model of crytiwopoietin. The proper folding of Epo into its native 
tertiary structure is necessary for stability and biological function. Muteins 
widi short deletions inside predicted a-helices were not processed and exhibited 
no biological activity, in contrast, when deletions were created in predicted 
inteiconnectiug loops, secreted proteins were detected, to varying degrees. 

20 bodi by radioimmunoassay and bioassay. Ftmhennore. additions or deletions 
at dtt N- or C-tennini did not markedly impair the secretion and the biologiiBal 
activity of fltt Epo protein^Moreover. mutations at Cys29 and CyS33 dwwed 

tiiat the small disulfide loop U not critical for biological activity. In order to 
delineate fiinctionally important leadues invcrfved in Ae direct binding 
25 onto die E^w receptor, we have prepared and tested a series of amino acid 
replacements on tiicsurfiaces of die predicted o helices. These experiments 
are desmbed in a Example HI. 
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EXAMPLEIII 



Erythropoietin Structure-Function Relationships; 
Mutelns That Test Functionally Important Domabis 



Mmduction 



The Epo Teceptor is a member of an ever enlarging family of cytoldne • 
receptors (Bazan. F.. Biochm. Biophys, Res. Commun. ;W:788.796 (1989)). 
m like manner. Epo shares wealc sequence homology with other members of 
a family of cytokines which also include growth hormone, prolactin. IL-l, 
IL-3. IW. IL-5. IL-6. IL-7. G-CSF. GM-CSF. M-CSF. oncostatin M, 
leukemia inhibitory factor and ciliary neurotrophic fectpr (Pany, p.A.D. 
etal., J. MoL Recognition i:107-110 (1988); Bazan F., Immmology 
Today iJ:350-354 (1990); Manavalan, P. et al.,J. Protdn Otem. iJ:321-331 
(1992)). The genes encoding these proteins have similar numbers of erons as 
well as a clear relationship between imron-exon boundaries and predicted o- 
15 helical bundles. Such tiieoretical models of the strucnires of human growth 
hormone (Cohen. F.E. etal., Protdhs: Struct. Fim. Genet. 2:162-166 
(1987)) and \\A (Curtis. B.M. etd.. Proteins: Struct. Func. Genet. 
ii:lll-119 (1991)) have been in remarkably good agreement with subsequent 
stiuctnies established by X-ray diffraction (HGH) (Abdel-Meguid. S.S. et al., 
20 Proc. Natl. Acad. Sd. USA W:6434-6437 (1987); de Vos. A.M. et al., 
Sdence 255:306-312 (1992)) or by multidimensional NMR flL-4) (Redfield, 
C. etd., BlochemistTy 3(^:11029-11035 (1991); Powers. R. etal., 
Sdence 256:1673-16n (1992)). Moreover, the crystal structures of GM-CSF 
(Diedciichs. K.«r &fe«ce 25^: 1779-1782 (1991)) and monomeric M-CSF 

25 (Pandit. J. etal., Sdence 25&1358-1362 (1992)) are also to reasonable 
agreement vAtti their predicted structures. 

The stmcnire of Epo has not yet been analyzed by either X-ray 
diffraction or by NMR. In order to begin lo gain an understanding of 
struciuie-limction rebtionships. we have taken a three-pronged approach: 
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a) Sequence determination of Epo from mammals of different 
orders in order to establish regions of homology. This work is 
described in Example I. 

b) Construction of a modej of the three-dimensional stnicture of 
S Epo, followed by the design and preparation of muteins that test 

this model. These experiments are presented in Example II; 

c) Design and testing of muteins that provide information on • 
receptor binding domain(s). This work is presented in this. 
Example. 

10 MATERIALS AND METHODS 

Construction of the Mutagenic/Mammalian Expression Plasmid. A 
M13 plasmid, containing a 1.4 Kb EcoRl-EcdSl human Epo cDNA insert 
QJIEpo FL12) was a gift from Genetics Institute (Cambridge, MA) (Jacobs. 
K. etal.. Nature JJi:806-810 (1985)). A 943 bp EcoKL-Bgin fragment, 
15 correqwnding to die complete coding sequence of the wild-Qfpe human 
eiyduq>oietin, including untranslated regions 216 bp upstream and 183 bp 
downstream, wi^ inserted into the mammalian expression plasmid pSG5 
(Sttatagene ) (Sambrook. J. et al.. In "Molecular Cloning: A Laboratory 
manual*. 2nd ed. Cold Spring Haibor. Laboratory. Cold Spring Haibor, 
20 N.Y., vol. l.ppa.38-1.41 (1989)) and named pSG5.^)^ 

SUe-JXrwted Mutegene^ ym earned out according to die protocol 
described by Konkd. T.A. etal., Meth. Eiaymbl. 154:3SJ'i22 (1987). 
Single-sttanded DNA was resoied from ihe pSG5-Epo/WT phagemid grown 
overnight in£. cofi 0236. in 2XYT media containing M13K07 helper phage 
25 0n NTitiogen) and 70 Mg/nl kanamat^ (Sigma. St Louis^ Mo). The resulting 
uiadl-contmning ssDNA was used as a tenq>late for mutagenesis. 
Oligonudeotides OA to 46 mer) were syndiesized vradi their 5' and 3' ends 
conq)lementary to die target i«dld-type Epo sequmoe. A large varieQr of 
mutations (base substitutions, deletions and insertions) were oeated at the 
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centersof^emutagenicprimerscquences. Annealing of the phosphorylated 

primers (10:1 oligonucleotide/DNA template molecular ratio) was performed 
in 10 Ml of a 20 mM Tris-HCL pH 7.4. 2 mM MgCl. 50 mM NaCl solution. 

The reactions we.^ incubated at 80^C for 5 minutes, then allowed to cool 
slowly to room temperature overaonehour period. ThcDNApolymerization 

was initiated by the addition as a mix of I of lOX synthesis buffer (100 mM 
Tris-HCl pH 7.4 50. mM MgCI. 10 mM ATP. 5 mM each dNTPs. 20 mM- 
DTD 0.5 Ml (8 units) of T4 DNA Ugase and 1 ^1 d unit) of T4 DNA 
polymerase (Boehringer Mannheim). Afu^r 2 hours at 37«C. 80 ^1 of IX TE 
was added. 5 m1 of the diluted reaction mix was used to transform competent 
£.«)ffNM522(ung*,dut*). 

Since a 40-80% muation yield is normally obtained. 4 to 5 double- 
stranded plasmid clones from each reaction were sequenced wiU, 7-deaza- 
dCrrP and Sequenase (U.S. Biochemical) (Sanger. Ketal., Proc. NatL Acad. 
15 Sd 104 7*5463-5467(1977)). As a rule, the entire coding sequences of tfu^ 
Epo mutams were examined for the presence of unwanted mutation by 
sequencing or restriction eniyme mapping. 

Pnductbn of WU-TSpe and Epo Mutms in MammaUan Cells. 
Cos7 cells grown to approx. 70« confluency were transfccted with 10 ME 
20 lecombinam plasmid DNA per 10 cm dish using the calcium phosphate 
piedpitation protocol (Kingsttm. R.E. In "Currem Protocols in 
Molecular Biology-, Green Ptibl. Assoc AWiley Interscience. N.Y.. vol. 1. 
pp. 9.1.1-9.1.9 (1991)). As a control of transCwaion efficiency, In several 
experiments 2 of pCHllO plasmid (Pharmacia) was oo-tt^^ 
25 galactosidase activity measured In Ac cytoplasmic extracts 

m Blot'Hybiidbfiiion Analysis. Total RNAs were prepared from 
culmredCos7celU(airgwin.J.M.etal..ffochentf^ 
and 2 MB samples electrophoresed on a 1.1% agarose gd oonoimng 2.2 M 
formaldehyde. Transfer to GeneScreen Plus filters (New England Nuclear) 
and hybridization wifli a «P-labeled wt Epo probe were carried out as 
previously described (Faquin. W.C. ef a/.. Blood 79:1987-1994 (1992)). 



30 
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QuantUatbn of TmasUatly-Exprmed Recombinant Epos. The 
amount of secreted protein in the supematants of tiansfected Cos? v«s 
determined by a tadioimmune assay (RIA). THe RIA was performed using a 
high titer rabbit polyclonal antiserum raised against the human wlld-t^ 

5 and produced in our laboratory. '"Mabeled recombinant Epo was obtained 
from Amersham. Details of the protocol have been published elsewhere 
(Faquin. W.C. et ai. Exp. Hematot. 21: 42(M26 (1993)). 

Bioassays. The dose-dependent proliferation activities of wt and Epo 
muteins were assayed m WW using tiiree different target cells: murine splien 

10 cdU, following a modification of die raeUiod of Krystal (Krystal. C. Exp. 
HmatoL ii:649-660 (1983); Goldberg. M.A. et ai. Proc. Natl. Acad. Sd. 
USA «:7972-7976 (1987)); murine Epo responsive murine cell line. 
develop«i by. Hankins (Hankins, W.D. et ai. Blood 7ftl73a (1987)); and 
htiman Epo^iependem UT-7/Epo cell line, derived from die bone marrow of 

15 a patient witit acute megakaryoblastic leukemia (Komatsu. N « a/.. Cancer 
Res. 57:341-348 (1991)). After 22 to 72 hours of incubation witii increasing 
amounts of recombinant proteins, cdlutar growtii was determined by pH]- 
tiiymidine (New England Nuclear) uptake or by tiie colorimetric MTT assay 

(Sigma) (Ydshimuia. A. etal., Proc. natl. Acad. Sd. USA «7:4139^143 

20 (1990). 

RESULTS AND DISCUSSION 

Our objective » to identify the fttnctionally important doniains on ti« 
Epo molecule by preparing muteins tiat have altered specific activity. i.e. 
significandy high or low biological activity per unit mass of protein. Such 
25 muteins must satisfy tiie following criteria: effictcni secretion by Cos? cells; 
full recognition (near normal binding affinity) by flte polyclonal anti-Epo 
antiserum tfuit we use in our radio-immune assay; preservation of flie overall 
folding and ditee dimensional structure. In order to meet diese criteria, die 
muteins tiat we employ in tins snidy have only single amino add 
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replacements. A comprehensive amino acid replacement scan of Epo would 
of necessity require the preparation and testing of a minimum of 166 mutelns. 
Such an undertaking would be prohibitively labor intensive and costly. 

In Older to dewse a radonal strategy to reduce fte number of required 

5 replacements, we have taken advantage of information we have obtained 
employing scanning deletion muteins (See Example II). On the basis of its 
primary amino acid sequence and disulfide bonds Epo, in common with other • 
members of the cytokine family is predicted to have a four antipaiallel 
amphipathic o-helical bundle structure. We have shown that deletions in aon- 

10 helical regions at the N-terminus, the G-terminus and in loops between helices 
result in the formation of protein that is readily secreted from the cell and is 
biologically active. These regions can be ruled out as functionally important 
domains such as the sites involved in the binding of ^ to its receptt>r. These 
results suggest thai the functionally important contact sites are likely to reside 

15 on die predicted a helices. Furthermore, since the amphipathic helices are 
predicted to bind to one another along their hydrophobic sur&ces. Ae 
biologically relevant contact sites are likely to be readues predicted to be on 
the external surfaces of the helices. These considerations greatly restrict the 
number of mutations needed to provide a comprehensive and informative body 

20 of data. 

• Because structure-function studies on growdi hormone and ILr4 liave 
indicated fliat Helices A and D are involved in recq>tor l^inding, most of our 
muteins have been directed to these regions. 

Results on Helix A muteins are shown in Figure 17. The top panel 

25 shows a helical wheel projection. Replacemem by alanine of die predicted 
external residues 13, 18, 20 and 21 have no significant impact on biological 
acdvi^ as detennined by the relative incoiporation of pH]-tfiymidine into Epo 
dq)endent HCD57 celk, Howevw, die rq)lacement of Aigl4 by Ala diowed 
maricedly impaired biolo^cal activiQr. 

30 Figure 18, top panel, shows a helical v^l piojection for Helix D. 

Nine of die nine predicted external residues were replaced by alaiune. 
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Mutations at positions 136. 139 and 140 gave rise to proteins that >vere 
efficiently secreted and had fiill biological acUvity. (Figure 18. bottom). In 
contrast Ala replacement of either Lysl40. Argl43. Scrl46. Asnl47 or 
Lysl54 resulted in muteins with significandy (S-foW) Increased biological 
5 activity. Replacement of Asnl47 with Ala resulted in die greatest increase in 
biological activity. (Figures 23-25). 

Because die C-terminal boundary of Helix D is uncertain, we tested • 
alanine replacements at positions 152, 154 and 156 which are predicted to be 
in the adjacent non-helical segment at die C-terminus. As shown in Figure 19. 
10 the Ala replacement at Lysl54 had slightly increased biological activity while 
die Ala replacement at Tyrl56 had slighdy decreased biological activity. 

The experiments reported above all involve testing die function of 
huroanEpomutcinsontiicmouseEporcsponsiveHCD57celHine. Figure 20 
shows a comparative display of those muteins diat appear to have Mghsi 

15 biological activity than wld-type 

In oiderto assess die significance of diese results we have also tested 

ihsx muteins in a inore physiologic system, mouse erytiiroid progenitors 
prepared from die spleens of animals challenged witii phenylhydrazine. This 
bioassay obviates die possible confounding effects of a malignant clonal cell 

20 line such as HCD57 which was derived fhOT mouse erytiiioleukemia Vitus. 

. As shown in Figuie 21, results wiflictyduold cells ftom mouse spleen 
were very similar to those obtained widi HCDS7 cells. 

In addition we have tested diesc muteins in an. Epo lesponsive human 
ceU line UT-7/Epo, derived ftom apaiiem widi acute leukemia. This ceU line 

25 provides die opportunity of examining die interaction of human muteins wifli 
the human Epo leceptor. As shown of ordinary in Figure 22. tiiese results 
closely parallel diose widi die two mouse cell bioassays. 

It should be noted tiiat in several of die dose response curves, at du: 
highest concentrations of Epo tested, diere is a significant drop-off in die 

30 incorporation of pHl-diymidine. This is due to die fiict diat high levels of 
biologically active Epo causes terminal maturation and cessation of division. 
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Because of this possibly confounding phenomenon, it is imponant to have, as 
in the experiments that we report here, a full dose lesponse curve for each 
mutein with an adequate number of data points. 

Taken together our results show that a restricted number of residues on 

5 Helix A and on Helix D are involved in Epo's biological activity. Further 
experiments are in progress to determine if the observed alterations in 
biological activity can be explained by parallel changes in the binding affimty " 
to 4e Epo receptor. In addition, at Ae sites that we have identified as 
involved in Epo's biological activity, we plan to make a set of additional 

10 replacement muteins in order to further refine the ligand-receptor interaction. 
Weareparticularly interested in developing muteins with maximally increased 
biological activity, e.g. by preparing compound or multiplex replacements, 
each of which confers optimal enhancemem of biological activity. Such a 
-superEpo" could have improved utility in the dierapy of anemias. 

Condttsions 

We have produced a number of erythropoietin muteins comprising 
• «ttivc erythropoietin with one or more modifications whidi improves ifae 
biolo^cal activity. 

• AUiefcreiicesmentionedhereinareincorpoiatedbyieferenceintoaus 
20 di9;losuie. 

Having now fiilly described the Invention by way of illustration and 
example for purposes of clarity and undemanding, it will be appaient lo tbbse 
of ordinary skill in the art that ccitain changes and modlficaiions may be 
practiced witiun tiie scope of tiie invention, as limited only by the following 
25 claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Brigham and Vtomen'a Hoopital 
75 Francis Street 
Boston, MA 02115 

INVENTORS: Boisael, Jean- Paul R. 
Bunn. H. Franklin 
Wen, Danyi 
Showers, Mark 0. 

(ii) TITLE OF INVENTION: Erythropoietin Muteine With Enhanced 
Activity 

(iii) NUMBER OF SEQUENCES: 59 

(iv) CORRESPONDENCE ADDRESS: ,^ . , ■„ 

(A) ADDRESSEE: Sterne, Keseler, Goldetein k Fox 

(B) STREET; 1100 New York Avenue, Suite 600 

(C) CITY: Washington 
<D) STATE: D.C. 

<E) COUNTRY: U.S.A. 
(F) ZIP: 20005-3934 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version ffl«2d 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: (to be assigned) 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: . 

(A) APPLICATION NUMBER: 08/049,802 
<B) FILING DATE: 21-APR-1993 

iC) CLASSIFICATION 1 

(viii) ATTORNEY/AGENT INFORMATION: 

' (A) NAME: Cimbala, Michele A. 

(B) REGISTRATION NUMBER: 33,851 

(C) REFERENCE/DOCKET NUMBER: 0627,336PC01 

(ix)' TBLEOCaWUNI CATION INFORMATICMI: 

(A) TELEPHONE: (202) 371-2600 

(B) TELEFAX: (202) 371-2540 

(2) INFORMATION FOR 8BQ ID HOjI: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
ATWGAATGAA SGC 

(2) INFORMATION FOR SEQ ID NO: 2: 



13 
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(i) SEQUENCE C3iARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

10 

AWGGTNSGGG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
TGCTTCTGCT ATCTTTGCTG CTGC 
(2) INFORMATION FOR SEQ ID N0:4: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(ik) FEATURE: 

<A) NAME/KEY: Peptide 

(B) LOCATION: 12.. 13 

(D) OTHER INFORMATION t /label- Peptide 

/note- "The amino acid at position 12 may also be 
ieoleucine. The amino acid at position 13 nay 
also be aepartie acid." 

<xi) 8BQUBNCB DS5CRIFTX0N: 6BQ ZD K0:4s 

A8P Thr Lys Val Asn Phe Tyr Ala Trp Lye Arg Met Glu Val Oly 



1 5 



(2) INFORMATICMI FOR SEQ 10 110*5: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 25 base pairs 
(B) TYPES nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TGAAGTTTGG CCGAGAAGTG GATGC 25 
(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs. 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
AAGAKGTACC TCTCCAGRAC TCG . 
(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
CTGCTCCACT CCGAACAMTC AC "22 
(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH X 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGYt both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
CTGGAGTGTC CATGGGACAG 20 
(2) INFORMATION FOR SEQ ID NO :9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 20 base pairs 

(B) TYPB: nucleic acid 

(C) STRARDEDNBSS: flingle 

(D) TOPOLOGYt both 

(xi) SEQUENCE DBSCRIPTZONi SEQ ID N0i9: 
AGGOGCGGXG ATGGGGGTGC 20 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(P) TOPOLOGY s both 

(xi) SEQUBNCE DESCRIPTION: SBQ ID NO: 10: 

TGGTA6CT6G GGGTGGGGTG T6CACACGGC AGCAGGATTG AATGAAGGCC AGGGAGGCAG 60 

CACCTGAGTG CTTGCATGGT TGGGGACAGG AAGGACGAGC TGGGGCAGAG ACGTGGGATG 120 

AAGGAAGCTG TCCTTCCACA GGCCACCCTT CTCCCTCCCC GCCTGACTCT CAGCCTGGCT 180 

ATCTGTTCTA GAATGTCCTG CCTGGCTGTG GCTTCTCCTG TCCCTGCTGT CGCTCCCTCT 240 
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120 
180 
240 
290 
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GGGCCTCCCA GTCCTGGGCG CCCCACCACG CCTOITCTGT GACAGC , 
(2) INFORMATION FOR SEO ID NO: 11: 

(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIiOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 
CTCGTAGTTG TTGGGTGGCG TGTGCACGCG GTCAGTGTAT TGAATGAAGG CAAAGGAGGC 
AGAGCCTGCG CCCTCGCAAG GTTGGGGTCG GAAACGGCTG GCTGGGGGCA AGACGGCCGG 
ATGGGTGAAC CTCTCCGTCT AAACCAACCC TTCTCCTGCA ATGCCCAGCC TCACATTCAG 
CCTGGCTCTC TTTCCTAGAA TGTCCTGCCC GGCTGCTTCT GCTATCTTTG CTGCTGCTTC 
CTCTGGGCCT CCCAGTCCTG GGCGCCCCCC CACGCCTCAT CTGTGACAGC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single . 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID H0il2: 
GTAGCTCGGG GGGTCGGGTC TGCAOGCGGC C6CGGGATTG AATGAAGGCA AAGGAGGCAG 

AGCCTGAGCO CTCGCAAGGT TCGGGTCGGG AGGACTCGCG GGGCCAGAGC AGCGGGATGG 120 

CTGAACGTCC AGGTCCAAAC CACCCCTTCT CCTGCAAGTC CCGGCCTCAC ACTCAOCCCA 180 

C5CTCTCTCTC CTAGAATCTC TTCTGCTGCT TCTGCTGCCC TCCTTGCTGC TGCTTCCXCT 240 

CGGCCTCCCfA qTCCTGGGCC CCCCCACACA CCTCATCTGT GAOIGC 286 

(2) INFOkMATION FOR SEO ID N0il3t 

(i) SEQOBNCB CHARACTERI8Tld3: 
<A) LENGTH; 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS r single 

(D) TOPOLOGY s both 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CCGTAGCTGT TGGGGTGGGG TGCGCACGAA GCCGCGGGAT TGAATtSAACG CGGAGGCAGA 60 

GCTGAGCGCT CCCAAGGTCC GGGTOGGGAG GGCTCACTGG GGGCT6CGCA GCGGGATGGG 120 

TCAACCTCCC GGTCCAAAGC A6CCCTTCTC CTG06AGAAT GCACCTCATC CGCAGCCCGT 180 

CTCTCTTTCC TAGACTGTAC TCCGCTGCTG CTGCTGCTGC TGTCTCTTCT GCTGTTTCCT 240 



60 
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CTGGGCCTCC CAGTCTTGGG CGCCCCCCCA CGCCTCATCT GTGACAGC 288 
(2) INFORMATION FOR SEQ ID N0:14: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: 6EQ ID N0:14: 

TGGGGGTG6G GTGT6CAGGC GGCAGCAGGA TTGAATGAAG GCAGAGGAGA CAGACCCTGA 60 

GCGCTGGGAA GGTTGGGGGC AGGAGCCACT AGCTGGGGGC AGAGGAGGGG GATGCGTGAA 120 

CCTGCCCCTC CAAACCAACC CTCCCCTTCC CCTCCCTGCC TCACACTCAG CCCGGATCTC 180 

CTTTCCAGAA TGTCCTGCCC TGCTGCTTCT GCTGTCCCTG CTACTGCCTC CTCTGGGCCT 240 

CCCAGCCCTG GGCGCCCCTC CACGCCTCAT CTGTGACACC 280 
(2) INFORMATION FOR SEQ ID N0:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDBDNBS8: single 
(0) TOPOLOGYi both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 

CAGGTCAGTT TGATTGGGTG GATGTGTGCC GCGAAGCAGC GGCCGGATTG AATGAAGGCA 60 

GGGGAGGCA6 AACCTGAACG CTGGGAA6GT G6G6GTCG6G CGCGACTAGT TGG6GGCAGA 120 

GGA6CG0GAT GTGTGAACCT GCCCCTCCAA ACCCACACAG TCAGCCTGGC ACTCTTTTCC ISO 

AGAATGTCCT G C CCTSTTCC rnT U C TOi t' TTTGCTGCT G CTTCCTCTGG ACCTCCCAGT 240 

CCTGGGOQCd 0CCCCT06CC TGATTTGTGA CA6C 274 
(2) . INFORMATION FOR SEQ ID H0:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH I 269 base pairs 

(B) TYPE: nucleic acid 
ic) STRANDEDNESS: single 
(D) TOPOLOGY: both 

(xi) SEQUENCE DBSCRIPTIONi SEQ ID N0xl6: 

TG6TAGTTT6 GOGGGTGGTG TGTGCTCACC GCGGCGGCGG CGGGGATTGA ATGAAGGCCA 60 

AGGAG6CAGA ACCCGAGC6C TGGCAAGGTT CGGGGTGGGA 6CGACAAGCT G6GGGCAGAG 120 

(SAAC6G6ATG TGTGAACCTG CTCCTCTCAAA CACACTCAGC CTGGCACTCT TTTCTAGAAT 180 

GTCCTGCCCT GCT(3CTTCTG CTATCTTT6C TGCT6CTTCC CCTG(jGCCTC CCAGTCCTGG 240 

GCGCCCCCCC TCGCCTCATC TGTGACAGC 269 
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60 
120 
180 
240 
268 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGTAGTTTGG CGGGTGGGGT GTCCTCACCG CGGCGGCGGG GGGGATTGAA TGAAGGCCAA 
GGAGGCAGAA CCCGAGCGCT GGGAAGGTTC GGGGTGGGAG CGACAAGCTG GGGGCAGAGG 
AACGGGATGT GTGAACCTGC TGCTCCAAAC ACACTCAGCC TGGCACTCTT TTCTAGAATG 
TCCTGCCCTG CTGCTTCTGC TATCTTTGCT GCTGCTTCCC . CTGGGCCTCC CAGTCCTGGG 
CGCCCCCCCT CGCCTCATCT GTGACAGC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(Xi) SE<?JENCE DESCRIPTION: SEQ ID NO: 18: 

CGGTCGCTTG GGGGGTGGGG TGTGCAGCGC GGAGGGATTG AATGAAGGCC ACTCAGGCAG 60 

AGCCTAAGCA ATTGCAAGGT CGGGGTCAGC AGAGACTATA AGGGCAGAGG GCTCTCGCTG 120 

AGCCAACCGG CCCCTGCAGT CCCACGGCCC CCTCCCCTCT CGGCCTCACA CTCA6CCTGC 180 

CTTTCTTCCA GAAOGTCCCA CCCTCCTGCT TTTACTCTCC TT6CTACTGA TTCCTCTGGG 240 

CCTCCCA6TC CTCTGTGCTC CCCCACGCCT CATCTGCGAC AGT 283 
(2) IMPORMATIOM PGR 8BQ ID K0il9i 

(i)' SEQUENCE CHARACTERISTICS: 

(A) UWGTH: 283 base pairs 

(B) TYPE: nucleic acid 

* (C) STRANDEDNESS X single 
(D) TOPOLOGY: both 

(xi) SEQOENCB DESCRIPTIONS 6BQ ID NO s 19 s 
ATGGTCGCTT GGGGG6TGGT GTGTGCAGCG CCGGGAGATA GAATGAAGGC CACTCAGGCA 60 
GAGCCTAAGC AGTTGCAAGG TCGGGGTCAG CAGAGACTGG AAGGGCAGAG GAGCCTCGCT 120 
GTGCAAACC6 GTCTCTGGTC GTCCCACGGC CCTCCCCTCC CAGCCTCACA CCCAGCCT6C 180 
CTTTCTTCTA GAACGTCCAA CCCTGCTGCT TTTGCTGTCT TTGCTCCTGC TTCCCCTGGG 240 
CCTCCCAGTC CTCTGCGCTC CCCCACGCCT CATCTGCGAC AGC 283. 
(2) INFORMATION FOR SEQ ID N0:20: 
(i) SEQUENCE CKARACTBRISTICS : 
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(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQDENCE DESCRIPTION; SEQ ID NO: 20: 
GGGTGGTOGC GATGAATGAA GCAGCAGCTA GGTGGGGGGC GGTGAAGTAC AGCC6TCTCA 
GAGTGTCTTC TCTCTGCCCT GGCCTCCCAG CTGGCCCCCC CTCATTGGAC AG 
(2) INFORKATION FOR SEQ ID N0:2I: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
.(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0;21: 

AGGCGCGGAG ATGGGGGTGC ACGAATGTCC TGCCTGGCTG TGGCTTCTCC TGTCCCTGCT 60 

GTCGCTCCCT CTGG6CCTCC CAGTCCTGGG CGCCCCACCA CGCCTCATCT CTGACAGCCG 120 

AGTCCTGGAG AGCIACCTCT TGGAGGCCAA GGAGGCCGAG AATATCAOGA CGGGCTOTGC 180. 

TGAACACTGC AGCTTGAATG AGAATATCAC TGTCCCAGAC ACCAAAC5TTA ATTTm 240 

CTGGAAGAGG ATGGA(3GTCG GGCAGCA6GC CGTAGAAGTC TGGCAGGGCC TGGCCCTGCT 300 

GTOJGAAGCT GTCCTGCGGG GCCAGGCCCT GTTGGTCAAC TCTTCCCAGC CGTG^ 360 

CCTGCAGCTG CATGTGGATA AAGCCGTCAG TG6CCTTCGC AGCCTCACCA CTCTGCTTCG 420 

GGCTCTOGGA 6CCCAGAAAG AAGCCATCTC CCCTCCAGAT GCGGCCTCAG CTGCTCCACT 480 

CCGAACAATC ACTGCTGACA CTTTCCGCAA ACTCTTCCGA GTCTACTCCA ATTTCCTCCG 540 

GGGAAAGCTG AAGCTGTACA CAGGGGAGGC CTCCAGGACA GGGGACAGAT GACCAGGTOT 600 

GTCCACCTGG GCATATCCAC CACCTCCCTC ACCAACATTG CTTCTGCCAC ACCCTCCCCC 660 

GGCCACTCCT GAACCCOGTC GAGGGCCTCT CAGCTCAGCG CCAGOCTGTC CCaTG<»CAC 720 

725 

TCCAG 

(2) INFORMATION FOR SEQ ID 110:22: 

(i) SEQOBNCB CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIOGTi both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 

ACGAATGTCC TGCCTGGCTG TGGCTTCTCC TGTCTCTCGT GTCGCTCCCT CTGGGCCTCC 60 

CAGTCCCGGG CGCCCCACCA CGCCTCGTCT CTGACAGCCG AGTCCTGGAG AGGTACCTCT 120 

TGGAGGCCAA GGAGGCCGAG AATGTCACGA TGGGCTGTTC CGAAAGCTGC AGCTTGAATG 180 
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AGAATATCAC CGTCCCAGAC ACCAAAGTTA ACTTCTATGC CTGGAAGAGG ATAGAGGTCG 240 

GGCAGCAGGC TGTAGAAGTC TGCCAGGGCC TGGCCCTGCT CTCAGAAGCT GTCCTGCGCG 300 

GCCAGGCCGT GTTGGCCMVC TCTTCCCAGC CTTTCGAGCC CCTGCAGCTG CACATGGA^^ 360 

AAGCCATCAG TGGCCTTCGC AGCATCACCA CTCTGCTTCG GGCGCTGGGA GCCCAGGAAG 420 

CCATCTCCCT CCCAGAT6CG 6CCTCGGCTG CTCCACTCCG AACCATCACT GCTGACACTT 4 80 

TCTGCAAACT CTTCCGAGTC TACTCCAATT TCCTCCGGGG AAAGCTGAAG CTGTACACGG S40 

GGGAGGCCTG CAGGAGAGGG GACAGATGAC CAGGTGCGTC CAGCTGGGCA CATCCACCAA 600 

CTCCCTCACC AACACTGCCT GTGCCACACC CTCCCTCACC ACTCCCGAAC CCCATCGA6G SCO 

£fil 

GGCTCTCAGC TAAGCGCCAG C 

(2) INFORMATION FOR SEQ ID NOs23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 679 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) T0P0UX5Y: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

CCGAACGTCC CACCCTGCTG CTTTTACTCT CCTTGCTACT GATTCCTCTG GGCCTCCCAG €0 

TCCTCTGTGC TCCCCCACGC CTCATCTGCG ACAGTCGAGT TCTGGAGAGG TACATCTTAC 120 

AGGCCAAGGA GGCAGAAAAT GTCACGATG6 GTTGTGCAGA AGGTCCCAGA CTGAGTGAAA 180 

ATATTACAGT CCCAGATACC AAA6TCAACT TCTATGCTTG GAAAAGAATG GAGGTGGAAG 240 

AACACGCCAT AGAAGTTTGG CAAGGCCTGT CCCTGCTCTC AGAAGCCATC CTGCAGGCCC 300 

AG6CCCTGCT AGCCAATTCC TCCCAGCCAC CAGAGACCCT TCAGCTTCAT ATAGACAAAG 360 

CCATCAOTGG TCTACGTAGC CTCACTTCAC TGCTTCGGGT ACTGGGAGCT CAGAAGGAAT 420 

TGATCTCGCC TCCAGAXACC ACCCCACCTG CTCGACTCCG AACACTCACA GTGGATACTT 460 

TCTGCAAOCT CTTCCCGGTC TACGCCAACT TCCTCCGGGG GAAACTGAA6 CTGTACACGG 540 

6AGAGGTCT6 CAGGAOIGGG GACAGGTGAC ATGCTGCTGC CACCGTGGTG GACCGAOGAA 600 

CTTGCTCCCC GTCACT GT GT CATGCCAACC CTCCACCACT CCCAACCCTC ATCAAACGGG 660 

cng 

TCATTAOCTT CTTACCAGT 

(2) IHPORMATIGN FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 678 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID KO: 24: 
CCGAACGTCC CACCCTGCTG CTTTTACTAT CCTTGCTACT GATTCCTCTG GGCCTCCCAG 60 
TCCTCTGC6C TCCCCCACGC CTCATTTGOG ACAGTCGCGT TCTGGAGAGG TACATCTTGG 120 
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AGGCCAAGGA GGCAGAAAAT GTCACAATGG GCTGTGCAGA AGGTCCCAGA CTGAGTGAGA X80 

ATATTACCGT CCCAGATACC AAAGTCAACT TCTACGCTTG GAAAAGAATG AAGGTGGAAG 240 

AACAGGCTGT AGAAGTTTGG CAAGGCCTGT CTCTGCTCTC AGAAGCCATC CTGCAGGCCC 300 

AGGCTCTGCA GGCCAATTCC TCCCAGCCAC CAGAGAGTCT TCAGCTTCAT ATAGACAAAG 360 

CCATCAGTGG GCTACGTAGC CTCACTTCAC TGCTTCGGGT GCTGGGAGCT CAGAAGGAAT 420 

TGATGTCGCC TCCAGACGCC ACCCAAGCCG CTCCACTCCG AACACTCACA GCGGATACTT 480. 

TCTGCAAGCT CTTCCGGGTC TACTCCAACT TCCTCCGGGG XSAAACTGAAG CTGTACACGG .540 

GGGAGGCCTG CAGGAGAGGG GACAGGTGAC CTGCCACTGC CGTGTACCCG CCAACTCGCT €00 

CACCGTCACT GTGTCACGCC AACCCTCCAC CACTCCCAAC CCTCATCAAA CGGGGTTGTT 660 

TGTTACCTTC TTACCGGC 678 
(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUEKGE CHARACTERISTICS: 

(A) LENGTH: 687 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: fiingle 
(0) TOPOLOGY; both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0S2S: 

GCGACTGTAC TCCGCTGCTG CTGCTGCTGC TGTCTCTTCT GCTGTTTCCT CTGG6CCTCC 60 

CAGTCTTGGG CGCCCCCCCA CGCCTCATCT GTGACAGCCG AGTCCTGGAG AGGTACATCC 120 

T6GACGCCAG GGAGGCCGAA AATGCCACGA TGGGCTGTGC AGAAGGCTGC AGCTTGAGfTG 180 

AGAATATCAC TGTCCCAGAC ACCAAGGTTA ACTTCTATGC CTG6AAGA0G ATGGAGGTCC 240 

AGCAGCAGGC TCTGGAAGTC TGGCAGGGCC TGGCTCTGCT CTCAGAA6CT ATCTTTCOGG 300 

GCCAG6CCCT ACCGGCCAAC GCATCCCAGC CATGCGAGGC CCTGCGGrTO CACCTG6ACA 360 

AAGCTGTCAG CGGCCTCOGC AGTCTCACCT CCCTGCTTCG GGCGCTGGGA GCCCAfiAAAG 420 

AAGCCATCCC* CCTTCCAGAT GCAACCCCCT CCGCAGOCCC ACTCOGAATA TTCACTGTTO 480 

ATGCTTTGTC CAAGCTCTTC C(»ATCTACT CCAATTTCCT GAGGOGAAAG CTGACGCTtST 540 

ACACAGGGGA GGCCTGCAGG AGAGGGGACR GGTGACCCAG TGCTATCACC CCGGGCACGT 600 

CCATCACCXC 6CTCACCATC ACTGCCTAC6 CCATGCCTTC CACGCCGCCA CTCCCAACCC 660 

CTOTCGACGA CGGGCCACCA GCTCAGT 687 

(2) INFORMATION FOR SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 688 base, pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; both 



(xi) SEQUENCE DESCRIPTION: SEQ ZD MO: 26: 
GAATGTCCrc CCCGGCItSCT TCTGCTATCT TTGCT6CTGC TTCCTCTGGG CCTCCCA6TC 



60 
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180 

240 

300 

360 

420 

480 

540* 

600 

6€0 

€88 



CTGGGCGCCC CCCCACGCCT CATCTGTGAC AGCCGAGTCC TGGAGAGGTA CATCTTGGAG 
GCCAAGGAGG GCGAAAATGC CACGATGGGC TGTGCCGAAA GCTGCAGCTT CAGTGAGAAT 
ATCACTGTCC CAGACACCAA GGTTAACTTC TATCCCTGGA AGAGGATGGA GGTCCAGCAG 
CAGGCCATGG AGCTCTGGCA GGGTCTGGCC CTGCTCTCAG AAGCCATCCT GCAGGGCCAG 
GCCCTGTTGG CCAACTCCTC CCAGCCATCT GAGGCCCTGC AGCTGCATGT GGACAAAGCT 
GTCAGCGGCC TGCGCAGCCT CACCTCCCTG CTTCGGGCGC TGGGAGCCCA GAAGGAAGCC 
ATCCCCCTTC CAGAC6CATC CCCCTCCTCT GCCACCCCAC TCCGAACATT TGCTGTTGAT 
ACTTTGTGCA AACTTTTCCG CAACTACTCC AAlTrCCTGC GGGGAAAGCT GACGCTCTAC 
ACTGGAGAGG CCTGCAGGAG AAGGGACAGG TGACTTGGTG CTCCCACCCG GGGCATGTCC 
ACCACTTCGC TCACCACCAC TGCCTGTGCC ATGCCTTCTG CACCTCCACT CCCAACCCCC 
GCTGAGGGGC CATCAGCTCA GCGCCAGC 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSi single 

(D) TOPOLOGY: both 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
6AATCTCCTG CCCTGCTGCT TCTGCTATCT TTGCTGCTGC TTCCCCTGGG CCTCCCAGTC 
CTGGGCGCCC CCCCTCGCCT CATCTGTGAC AGCCGAGTCC TGGAGAGGTA CATTCTGGAG 
GCCAGGGAGG CCGAAAATGT CACGATGGGC TGTGCTGAAG GCTGCAGCTT CAGTGAGAAT 
ATCACTCTCC CAGACACCAA GGTCAACTTC TATACCTGGA AGAGGATGGA CGTCGGGCAG 
CAGGCTGTGG AAGTCTGGCA GGGCCTCGCC CTGCTCTCAG AAGCCATCCT GCGGG6CCAC 
GCCCTGCTGG CCAACTCCTC CCAGCCATCT GAGACCCTGC AGCTGCATGT GGATAAAGCC 
6TCAGCAGCC TGCGCAGCCT CACCTCCCTG CTTCGGGCAC TGGGAGCCa GAA^^ 
ACCTCCCTTC CAGAGGCAAC CtCTGCTGCT CCACTCCGAA CATTCACICT CGATACTTTG 
TGCAAACTTT TCCGAATCTA CTCCAACTTC CTGCGGGGAA AGCTGACGCT GTACACAGGG 540 
GAGGCCTGCC GAAGAGGAGA CAGGTGACCA G6TGCTCCTA CCCCGGGCAT GTCCACCACC 
TCACTTACCA CCAATGCCTG TGCCACGCCC TCTGCACCAC CACTCCTGAC CCCTGTOGGG 
GGTGATCAGC TCAGCAOCAG C 

(2) INFORMATION FOR SEQ ID NO: 28: 

(1) SBtyJENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOs28: 
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Met Gly Val His Glu Cye Pro Ala Trp Leu Trp Leu Leu Leu Ser Leu 

1 '5 

Leu Ser Leu Pro Leu Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 .25 

(2) INFORMATION FOR SEQ ID NOs29: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
is) TYPE: amino acid 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Gly Val Hia Glu Cys Pro Ala Trp Leu Trp Leu Leu Leu Ser Leu 
1 5 . 10 " 

Val Ser Leu Pro Leu Gly Leu Pro Val Pro Gly Ala Pro Pro 
20 25 . 

(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY! both 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0!30: 

Met Gly Val Pro Glu Arg Pro Thr Leu Leu Leu Leu Leu Ser Leu Leu 
1 5 10 ^* 

Leu lie Pro Leu Gly Leu Pro Val Leu Cys Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0_ amino acids 

(B) TYPE: amixib acid 
(D) TOPOLOGY: both 



(xD'SEQUBNCE DESCRIPTION: SEQ ID N0:3l: 

Met Gly Val Arg Asp Cye Thr Pro Leu Leu Leu Leu Leu Leu Ser Leu 
. ^ ' 5 10 15 

Leu Leu Phe Pro Leu Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 30 

(2) INPORMATICm FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTO: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY s both 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Glu cys Pro Ala Arg Leu Leu Leu Leu Ser Leu Leu Leu Leu Pro Leu 
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Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 25 amino acids 
(B) TYPE: amino acid' 
(D) TOPOLOGY: both 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
. Glu eye Pro Ala Leu Leu Leu Leu Leu Ser Leu Leu Leu Pro Pro Leu 
1 5 1^ 

Gly Leu Pro Ala Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQOEHCE DESCRIPTION: SEQ ID HOi34i 

Glu eye Pro Ala Leu Leu Leu Leu Leu Ser Leu Leu Leu Leu Pro Leu 

1 5 . 

Gly Leu Pro Val Leu Gly Ala Pro Pro 
^ 20 25 

(2) IHPORMATIOH FOR SEQ ID 110:35: 

(i) 8BQ0EHCB CHARACTERISTICS: 

(A) LBNGrTU: 25 amino acids 

(B) TYPB: anino acid 
•(D) TOPOLOOYi both 



(xi) 6B0DBNCB DBSCRIPTZONx 8BQ ID NOtBSs 

Glu Cya Pro Ala Leu Phe Leu Leu Leu Ser Leu Leu Leu Leu Pro Leu 

A6P Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SBQ ID NO:36: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SBQUBMCB DESCRIPTION: SEQ ID NO:36: 
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Glu Cye Leu Leu Leu Leu Leu Leu Pro Ser Leu Leu Leu Uu Pro Leu 
1 5 10 15 

Gly Leu Pro Val Leu Gly Ala Pro Pro 
20 25 

(2) INFORMATION FOR SEO ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

Ala Pro Pro Arg Leu He Cye Aep Ser Arg Val Leu Glu Arg Tyr Leu 
1 5 10 ^ . 

Leu Glu Ala Lys Glu Ala Glu Asn He Thr Thr Gly Cys Ala Glu His 
20 25 30 

Cye Ser Leu Asn Glu Asn He Thr Val Pro Asp Thr Lye Val Asn Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 €0 

Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu 
65 70 75 80 

Leu Val Asn Ser Ser Gin Pro Trp Glu Pro Leu Gin Leu His Val Aep 
85 90 95 

Lye Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Lys Glu Ala He Ser Pro Pro Aep Ala Ala Ser Ala Ala 
115 120 125 

Pro Leu Arg Thr. He Thr Ala Aep Thr Phe Arg Lye Leu Phe Arg Val 
130 135 140 

Tyr Ser Asn Phe Leu Arg Gly Lye Leu Lye Leu Tyr Thr Gly Glu Ala 
145 150 155 160 

Cys Arg Thr Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO:38: 

(i) SBQOENCB C8ARACTBRISTICS: 

(A) LENGTHS 165 amino acids 

(B) TCPBt amino acid 
(D) TOPOLOGY: both 



(xi) 6BQUENCB DESCRIPTION: SEQ ID NOi38: 

Ala Pro Pro Arg Leu He Cye Aep Ser Arg Val Leu Glu Arg Tyr Leu 
1 5 10 15 

Leu Glu Ala Lys Glu Ala Glu Asn Val Thr Met Gly Cys Ser Glu Ser 
20 25 30 

CVB Ser Leu Asn Glu Asn lie Thr Val Pro Asp Thr Lye Val Asn Phe 
^35 40 45 
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Tyr Ala Trp Lye Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Uu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Val 
65 70 75 80 

Leu Ala Aan Ser Ser Gin Pro Phe Glu Pro Leii Gin Leu His Met Aep 
85 90 95 

Lye Ala lie Ser Gly Leu Arg Ser He Thr Thr Leu Leu Arg Ala Leu 
^ 100 105 110 

Gly Ala Gin Glu Ala He Ser Leu Pra Aep Ala Ala Ser Ala Ala Pro 
US 120 125 

Leu Arg Thr He Thr Ala Aep Thr Phe Cys Lye Leu Phe Arg Val Tyr 
130 135 140 

Ser Aan Phe Leu Arg Gly Lye Leu Lys Leu Tyr Thr Gly Glu Ala Cys 
145 ISO 155 160 

Arg Arg Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO; 39 i 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY s both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

Ala Pro Pro Arg Leu Val Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
1 5 10 15 

Leu Glu Ala Lys Glu Ala Glu Asn Val Thr Met Gly Cys Ser Glu Ser 
20 25 30 

Cys Ser Leu Asn Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tyr Ala Trp Lys Arg He Glu Val Gly Gin Gin Ala Val Glu Val Trp 
SO 55 €0 

Gin «ly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Val 
€5 70 75 ao 

Leu Ala Asn Ser Ser Gin Pro Phe Glu Pro Leu Gin Leu His Met Asp 
85 90 95 

Lys Ala He Ser Gly Leu Arg Ser He Thr Thr Leu Leu Arg Ala Leu 
' 100 105 110 

Gly Ala Gin Glu Ala He Ser Leu Pro Asp Ala Ala Ser Ala Ala Pro 
115 120 125 

Leu Arg Thr He Thr Ala Asp Thr Phe Cys Lys Leu Phe Arg Val Tyr 
130 "5 140 

Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala Cys 
145 150 155 160 

Arg Arg Gly Asp Arg 
165 
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(2) INFORMATION FOR SEQ ID KO:40: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Ala Pro Pro Arg Leu lie Cye Aep Ser Arg Val Leu Glu Arg Tyr lie 
1 5 10 15 

Leu Glu Ala Lye Glu Ala Glu Aen Val Thr Met Gly Cye Ala. Glu Gly 
20 25 30 

Pro Arg Leu Ser Glu Aen lie Thr Val Pro Asp Thr Lye Val Aen Phe 
35 40 45 

Tyr Ala Trp Lye Arg Met Glu Val Glu Glu Gin Ala lie Glu Val Trp 
50 55 €0 

Gin Gly Leu Ser Leu Leu Ser Glu Ala He Leu Gin Ala Gin Ala Leu 

65 70 ^s eo 

Leu Ala Aen Ser Ser Gin Pro Pro Glu Thr Leu Gin Leu Hia He Aop 
85 90 ^5 

Lve Ala He Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Val Leu 
^ 100 105 110 

Gly Ala Gin Lys Glu Leu Met Ser Pro Pro Asp Thr Thr Pro Pro Ala 
^ lis 120 125 

Pro Leu Arg Thr Leu Thr Val Asp Thr Phe Cye Lys Leu Phe Arg Val 
130 135 140 

Tyr Ala Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Val 
145 150 155 1€0 

Cys Arg Arg Gly Asp Arg 
1€S 

(2) INFORMATION FOR SEQ ID NO :41s 

(i) SEQUENCE CHARACTERISTICS: . . 

(A) LENGTH: 1S6 amino acids 
• {B) TYPE: amino acid 
(D) TOPOLOGY s both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr lie 
X S 10 15 

Leu Glu Ala Lys Glu Ala Glu Asn Val Thr Met Gly Cye Ala Glu Gly 
20 25 30 

Pro Arg Leu Ser Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 . 45 

Tyr Ala Trp Lye Arg Met Lys Val Glu Glu Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Leu Ser Leu Leu Ser Glu Ala He Leu Gin Ala Gin Ala Leu 
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70 75 80 



<5 

Gin Ala A«n Ser Ser Gin Pro Pro 6lu Ser Uu Gin Leu Hia lie Asp 
85 90 . 

Lya Ala lie Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Val Leu 



100 



Gly Ala Gin Lys Glu Leu Met Ser Pro Pro. Asp Ala Thr Gin Ala Ala 

115 120 
Pro Leu Arg Thr Leu Thr Ala Asp Thr Phe Cys Lys Leu Phe Arg Val 



130 



Tyr ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 



145 ISO 15S 

Cys Arg Arg Gly Asp Arg 
1€5 

(2) INFORMATION FOR SEQ ID NO:42! 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 167 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Ala Pro Pro Arg Leu lie Cys Asp Ser Arg Val Leu Glu Arg Tyr He 

1 5 ^ 

Leu Glu Ala Arg Glu Ala Glu Asn Ala Thr Met Gly Cys Ala Glu Gly 
20 25 

cys ser Phe Ser Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 • . 

Tyr Ala Irp Lys Arg Met Glu Val Gin Gin Gin Ala Leu Glu Trp 

' «A cc 60 



so 



Gin Gly Leu Ala Leu Leu. Ser Glu Ala He »e Arg Gly Gin Ala Leu 
65 " . * 

Pzi> Ala Aan Ala Ser Gin Pro Cys Glu Ala teu Arg Uu Hia Val Asp 
85 90 " 

Lye Ala Val Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Ala Leu 

. ' 100 

Gly Ala Gin Lys Glu Ala He Pro Leu Pro Asp Ala Thr Pro 8cr Ala 
115 ^20 

Ala Pro Leu Arg He Phe Thr Val Asp Ala Leu Ser Lye Leii Phe Arg 
130 140 

He Tyr Ser Asn Phe Leu Arg Gly Lys Leu Thr Leu Tyr Thr Gly Glu 
145 150 . 155 160 

Ala Cys Arg Arg Gly Asp Arg 

165 

<2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Ala Pro Pro Arg Leu lie Cys Aep Ser Arg Val Leu Glu Arg Tyr He 
1 5 .10 15 

Leu Glu Ala Lys Glu Gly Glu Asn Ala Thr Met Gly Cys Ala Glu Ser 
20 25 30 

cys Ser Phe Ser Glu Aen He Thr Val Pro Aep Thr Lye Val Aen Phe 
35 *S 

Tyr Ala Trp Lyo Arg Met Glu Val Gin Gin Gin Ala Met Glu Val Trp 

50 .55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala He Leu Gin Gly Gin Ala Leu 
65 70 75 80 

Leu Ala Aen Ser Ser Gin Pro Ser Glu Ala Leu Gin Leu His Val Aep 
85 90 95 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Ser Leu Leu Arg Ala Leu 
100 105 110 

Gly Ala Gin Lys Glu Ala He Pro Leu Pro Aep Ala Ser Pro Ser Ser 
115 120 125 

Ala Thr Pro Leu Arg Thr Phe Ala Val Aep Thr Leu Cys Lye Leu Phe 
130 135 140 

Arg Aen Tyr Ser Aen Phe Leu Arg Gly Lys Leu Thr Leu Tyr Thr Gly 
145 150 155 160 

Glu Ala Cye Arg Arg Arg Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQOENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPBs amino acid 
(D) TOPOLOGYs both 



(xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr He 
15 10 15 

Leu Glu Ala Arg Glu Ala Glu Asn Val Thr Met Gly Cys Ala Glu Gly 
20 25 30 

Cys Ser Phe Ser Glu Asn He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tyr Thr Trp Lys Arg Met Asp Val Gly Gin Gin Ala Val Glu Val Trp 
SO 55 . 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala He Leu Arg Gly Gin Ala Leu 
65 70 75 80 

Leu Ala Asn Ser Ser Gin Pro Ser Glu Thr Leu Gin Leu His Val Asp 
85 90 95 
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Lye Ala Val Ser Ser Leu Arg Ser Leu Thr Ser Leu Leu Arg AU Leu 

. .. 100 

Gly Ala Gin Lye Glu Ala Thr Ser Leu Pro .Glu Ala Thr Ser Ala Ala 
lis 120 12b 

Pro Uu Arg Thr Phe Thr Val Asp Thr Leu Cya Lye Leu Phe Arg He 
130 135 . 

Tyr ser Asn Phe Leu Arg Gly Lya Leu Thr Leu Tyr Thr Gly Glu Ala 
145 ISO 155 ibo 

eye Arg Arg Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
1 5 10 " 

Leu Glu Ala Lys Glu Ala Glu Aen He Thr Thr Gly Cya Ala Glu Hia 
20 25 30 

cys Ser Leu Asn Glu Asn He Thr Val Pro Asp Thr Lys Val Aen Phe 
35 40 *5 

Tyr Ala Trp Lys Arg Met Glu Gly Val Gin Gin Ala Val Glu Val Trp 
50 55 60 

Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu 
65 ''0 . 

Leu Val Aen Ser Ser Gin Pro Trp Glu Pro Leu Gin Leu Hie Val Aep 
65 .90 95 

Lye Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
- . 100 105 110 

Gly Ala Gin Lye Olu Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala 
lis 120 1*5 

Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val 
130 135 1*0 

Tyr ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
145 ISO 155 160 

Cye Arg Thr Gly Asp Arg 
165 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acide 

(B) TYPE: amino add . 
(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 



Ser Arg Val Leu Glu Arg Tyr Leu Leu Glu Xaa Xaa Ala Lye 



1 5 
(2) INFORMATION FOR SEQ ID N0:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



Gin Ala Val Glu Val Trp Gin Gly Leu Ala Leu Leu Ser Glu Ala Val 
1 5 10 15 

Leu Arg 



(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: 8EQ ID NOi48: 



Pro Leu Gin Leu Hie Val Aep Lye Ala Val Ser Gly Leu Arg Ser Leu 
1 5 10 15 

Thr Thr 



(2) INFORMATION iPOR 8BQ ID ilOi49: 

(i) SBQUBNCB CHARACTBRISTICSi 
(A) LENGTH t 21 emino aeide 
* (B) TYPES eiaino acid 
(D) TOPOLOGY: both 



(xi) SBQUENCE OBSCRIPTIW: SEQ ID Il0t49: 

Thr He Thr Ala Aep Bir Phe Arg Lys Leu Phe Arg Val Tyr Ser Aen 

I 5 10 15 . 

Phe Leu Arg Gly Lye 
20 

(2) INFORMATION FOR SEQ ID HO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acide 

(B) TYPE: amino acid 
(D) T0POL0(3Y: both 



(xi) SB(3!UENCE DESCRIPTION: SEQ ID K0:50: 



wo 94/24160 



PCT/US94/04361 



-90- 



Glu Asn lie Thr Thr Gly Cys Ala Glu His Cys Ser Leu Aan Glu Asn 
1 5 1° 

He Thr Val Pro Aep Thr Lya Val Asn Phe Tyr Ala Trp Lye Arg Asn 



20 2S 

Glu Val Gly Gin 
35 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 27 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:51:. 

Leu Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Lys Glu Ala lie Ser 
1 5 

Pro Pro Asp Ala Ala Ser Ala Ala Pro Leu Arg 
20 25 

(2) INFORMATION FOR SEQ ID N0:52i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTIMi SEQ ID N0iS2x 

Leu Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Lye Leu He Ser Glu 
X 5 10 

Glu A8P Leu Clu Ala He Ser Pro Pro Asp Ala Ala Ser A^a Ala Pro 
20 ^' 

Leu Arg 

(2) INFORMATION FOR SEQ ID 1K>:S3: 

(ir SEQUENCE CHARACTERISTICS X 

(A) LBNGIH: € amino acide 

(B) TYPE: amino add 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
Cys Arg Thr Gly Asp Arg 

(2) INFORMATION FOR SBQ ID HOs54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 
Lye Aep GIu Leu 

a 

(2) INFORMATION FOR SEQ ID NO:555 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: € amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

His His His Hie His His 
1 5 

(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:S6: 

Met Gly His Ser Ser Gly Hie lie GIu Gly Arg Hie Met Ala Pro Pro 
I 5 10 



(2) INFORMATION FOR SEQ ID N0:S7i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SBQUBNCB DBSCRIPTZON: SEQ ID NO:57s 

Val'Leu Glu Arg Tyr Leu Leu GIu Ala Lye Glu Ala 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SBC^IENCB CHARACTERISTICS: 

(A) LENGTH I 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID IK>:S8: 

• Arg Thr lie Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val Tyr Ser 
I 5 10 1^ 

Aan Phe Leu Arg . 

20 

(2) INFORMATION FOR SEQ ID NO : 59 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOUXSY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:S9: 
Gly Lys Leu Lye Leu Tyr Thr Cly Glu Ala Cys 
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Claims: 



1 1. An erythropoietin mutein comprising native erythropoietin with 

2 one or more of the following modifications: 

3 (a) replacement of the amino acid at portion 20 with a first 

4 substitute amino acid; 

5 (b) replacement of the amino acid at position 49 with a 
5 second substitute amino acid; 

7 (c) replacement of the amino acid at position 73 with a 

8 third substitute amino acid; 

9 (d) replacement of the amino acid at position 140 with a 

10 fourth substitute amino acid; 

11 (e) replacement of the amino acid at position 143 with a 

12 fifth substitute amino acid; and 

13 (f) iq)lacement of the amiro acid at position 146 with a 

14 dxth substitute amino acid; 

15 (g) replacement of the amino acid at position 147 with a 

16 seventh substitute amino add; 

17 (h) replacement of the amino add at portion 154 with a 
Ig dghth substitute amino add; 

19 wl«eidn ssud erytbropoiedn mutein exhibits enhanced biological activiQr 

20 compared to said native erytiiropoietin. 

1 2. The erytiuppoietin mutein of claim 1 oomprisiiig native 

2 eiyduopoietin widi one of ssud modifications. 

1 3. The eiydiropoietin mutein of dmm 1 wherein said first, second, 

2 tiiiid, fouitii, fifUi, sixdi, sevemh or eighdi substitute amino acids are selected 

3 bom tiie group con»sting of alanine, serine, tiireonine, and glydne. 
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1 4, The erythropoietin mutein of claim 3 wherein said first 

2 substitute amino acid is serine, and/or said second, third, fourth, fifth, sixth, 

3 seventh and/or eighth substitute amino acids are alanine. 

5. The erythropoietin mutein of claim 2 wherein the amino acid at 
position 20 is replaced witii tfie amino acid alanine. 

1 6. The erythropoietin mutein of claim 2 wherein the amino acid at 

2 position 49 is replaced witii the amino acid serine. 

3 7. The erytiiropoietin mutein of claim 2 wherein the amino acid at 

4 position 73 is replaced witii die amino acid glycine. 

5 8. The erytiiropoietin mutein of claim 2 wherein die amino acid at 

6 position 140 is replaced witii tiie amino acid alanine. 

7 9. Theerytiuopoietin mutein of claim 2 wherein the amino acid at 

8 position 143 is replaced witii tiie amino acid alanine. 

9 10. The erytiirqwietin mutein of claim 2 wherein tiic amino acid at 

10 position 146 is replaced witii tiie amino acid alanine. 

IL The erytiiropoietin mutein of claim 2 wherein die amino acid at 
position 147 is i^laced widi die amino acid alanine. 

1 12. The erytiiropoietin mutein of claim 2 wherein die amino acid at 

2 position 154 is r^laced mfb die amino acid alaiune. 



1 13. A DNA molecule encoding die erytiiropoietin mutein of 

2 cl^ml. 
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1 14. A recombinant DNA construct comprising the DNA molecule 

2 of claim 13. . 

1 15. A vector comprising Ae DNA construct of clam 14 which is 

2 capable of expressing said erythropoietin mutein in a host cell. 

1 16. A host cell comprising the DNA molecule of cWm 13 capable 

2 of expressing the erythropoietin mutein encoded by said DNA molecule. 

1 17. The host cell of claim 16 wherein said host cell is capable of 

2 glycosylating said erythropoietin mutein. 

1 18. The host cell of claim 17 wherein said host cell is a mammalian 

2 cell. 

1 19. The host cell of claim 18 wherein said host ceiris selected from 

2 the group consisting of Chinese Hamster Ovary cells, Cos7 oells,Cosl cells, 

3 baby hamster kidney cells, and CVl cells. 

1 20. A method of stimulating eryduocyte production in a subjea, 

2 wherein ssud method comprises administering to said subject an cfficadous 

3 amountof the eiytfarqx>iedn mutein of claim 1. 

1 21. A medKxl of stimulating ciytiirocytc production in a subject, 

2 said medMxl compriang admimstering to said subject an efficacious amount of 

3 an eiytliropotetin mutein comprising native erytfiK^ietin wtl» die amino add 

4 leadue at podtion 143 iqilaced by an alanine residue, wlierein said efficacious 

5 amount is less dian the corresponding amount of native eiyduqKnetin. 
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1 22. A method of inducing the proliferation of a cell culture 

2 responsive to erythropoietin, wherein said method comprises contacting said 

3 cell culture with the erythropoietin mutein of claim 1. 

1 23. A method of inducing the proliferation of a cell culture 

2 responsive to erythropoietin, wherein said metfiod comprises contacting said 

3 cell culnirc witfi an efficacious amount of an erythropoietin mutein comprising • 

4 native erydiropoictin with the amino acid residue at position 143 replaced by 

5 an alamne residue, wherein said efficacious amount is less Uian the 

6 corresponding amount of native eiydiropoietin. 

1 24. The mediod of claim 18 or 19 wherein said cell culmre 

2 comprises cells selected from die group consisting of murine spleen cells, 

3 murine erythroleukemia cells, and human UT-7/Epo cells. 

1 25. A metiiod for obtaining erytiiropoietin muteins widi enhanced 

2 biological activity relative to native erytiiropoietin, said mediod comprising the 

3 stq)s: 

4 (a) prq)aring erytiiropoietin muteins comprising native erytiiropoietin 

5 witii a single amino acid substitution, said substitution occurring at a position 

6 selected ftom die group consisting of positions 48-52, 151-160, and poations 

7 predicted to reside on die external surface of helices A and D; 

8 ' (b) assaying die biological activity of the ciyflinqwietin muteins from 

9 step (a) over a range of dosages; and 

10 (c) identifying tiiose erytiiropoietin muteins witii enhanced biological 

1 1 activiqr relative to native erytiiropoietin. 
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